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® Recombinant acquired immune deficiency syndrome (AIDS) viral envelope protein and method of 
0) testing for AIDS. 



q© Recombinant envelope proteins of the etioiogic 
agent of acquired immune deficiency syndrome - 

QJ( AID $) useful in diagnosis and therapy of AIDS and a 
method for their preparation are described. Proviral 
DNA is transferred into a host cell after engineering 



into an expression vector which produces the en- 
velope protein. A method of testing human blood for 
the presence of antibodies to the AIDS virus using 
the AIDS envelope protein is also described. 
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Recombinant acquired immune deficiency syndrome (AIDS) viral envelope protein and method of testing for AIDS 



This invention relates to a protein, designated 
env AIDS, which is the viral envelope protein of the 
etiologic agent for acquired immune deficiency 
syndrome, the expression vector which encodes 
env AIDS, the production of env AIDS using recom- 
binant DNA technology and a method for detecting 
the presence of AIDS antibodies in human blood. 

Background of the Invention 

From 1981 to date, there have been more than 
eight thousand (8.000) people diagnosed as having 
acquired immune deficiency syndrome (AIDS) - 
[N.Y. Times, A-11 January 11, 1985]. AIDS has 
been characterized by the onset of severe op- 
portunistic infections secondary to an effect on the 
body's immune system [Gottlieb, M.S. et al„ 
"Pneumocystis Carinii Pneumonia and Mucosal 
Candidiasis in previously healthy homosexual men: 
evidence of a new acquired cellular im- 
munodeficiency", N. Eng. J. Med. 305, 1426-1431 - 
(1981)]. The disease has been found in male 
homosexuals, patients receiving blood products, in- 
travenous drug addicts, and individuals originating 
from Haiti and Central Africa [Piot, P. et al., 
"Acquired immunodeficiency syndrome in a het- 
erosexual population in Zaire", Lancet 11, 65-69 - 
(1984)]. The causative agent was suspected to be 
of viral origin as the epidemiological pattern of 
AIDS was consistent with a transmissable disease. 
At least three (3) retroviruses have been isolated 
from cultured T-celis of several patients with AIDS, 
or from white blood cells of persons at risk for the 
disease. A novel human retrovirus called 
lymphadenopathy-associated virus (LAV) was dis- 
covered and its properties were consistent with its 
etiological role in AIDS. That virus was isolated 
from a patient with lymphadenopathy and hence 
the name [Montagnier, L et al., "A New Human T- 
lymphotropic retrovirus: characterization and possi- 
ble role in iymphadenopathy and acquired immune 
deficiency syndromes. In Human T-Cell 
Leukemia/Lymphoma Virus, R.C. Gallo, M. Essex 
and L Gross, eds. (Cold Spring Harbor. N.Y.: Cold 
Spring Harbor Laboratory) pp. 363-370], Other hu- 
man retroviruses, specifically two subgroups of the 
human T-cell leukemia^ymphoma/Iymphotropic 
virus, types I and III have been isolated [HTLV I: 
Poiesz, B.J. et al., "Detection and isolation of type 
C retrovirus particles from fresh and cultured lym- 
phocytes of a patient with cutaneous T-cell lym- 
phoma", PNAS (USA) 77, 7415-7419 (1980); HTLV- 
III: Popovic, M. et al., "Detection, isolation and 
continuous production of cytopathic retroviruses - 



(HTLV-III) from patients with AIDS and pre-AIDS", 
Science 224, 497-500 (1984)]. Still another virus! 
the AIDS-associated retrovirus (ARV), was pro- 
posed as the causative agent [Levy, J A. et ai., 
5 "Isolation of lymphocytopathic retroviruses from 
San Francisco patients with AIDS", Science 225, 
840-842 (1984)]. Both the HTLV-III and ARV retrch 
viruses display biological and sero-epidemiologicaJ 
properties similar to LAV [Levy JA et al., supra, 
to Popovic, M. et al., supra]. As seen from the above, 
at least three (3) retroviruses have been postulated 
as the etiologic agent of AIDS: LAV; ARV; and, 
HTLV subtypes I and III. 

LAV, HTLV 111 and ARV-II genomes have been 
is molecularly cloned [SchQpbach, J. et al., 
"Serological analysis of a subgroup of human T- 
lymphotropic retroviruses (HTLV-III) associated with 
AIDS", Science 224. 503-505 (1984); Afizon, M. et 
al., "Molecular Cloning of Iymphadenopathy -asso- 
20 dated virus", Nature 312, 757-760 (1984)]. The 
complete nucleotide sequence of the proviral 
genome of LAV, ARV and HTLV HI has been deter- 
mined [Ratner, L et ai.. "Complete nucleotide se- 
quence of the AIDS virus. HTLV III", Naure 313, 
25 277-284 (1985); Sanchez-Pescador, R. et al., 
"Nucleotide sequence and expression of an AJDS- 
associated retrovirus (ARV-2)", Science 227, 484- 
492 (1985); Wain-Hobson, S. et al., "Nucleotide 
sequence of the AIDS virus, LAV", Cell 40, 9-17 - 
so (1985)]. 

One reason for the difficulty in determining the 
etiologic agent of AIDS was due to the reactivity of 
various retroviral antigens with serum samples from 
AIDS patients. For example, serum samples from 
35 AIDS patients have been shown to react with anti- 
gens of HTLV I and HTLV III [HTLV-I: Essex, M. et 
al., "Antibodies to Cell Membrane Antigens Asso- 
ciated wfth Human T-Cell Leukemia Virus in Pa- 
tients with AIDS". Science 220, 859-862 (1983); 
40 HTLV-III:" Sarngadharan, M.G. et aJ., "Antibodies 
Reactive With Human T-Lymphotropic Retroviruses 
(HTLV-III) in the Serum of Patients Wrth Aids". 
Science 224, 506-508 (1984)]. Envelope gene pro- 
ducts of HTLV demonstrated antigenicities cross- 
45 reactive with antibodies in sera from adult T-cell 
leukemia patients [Kiyokawa, T. et al.. "Envelope 
proteins of human T-cell leukemia virus; Expres- 
sion in Escherichia coli and its application to stud- 
ies of env gene functions", PNAS (USA) 81 . 6202- 
so 6206 (1984)]. Adult T-cell leukemia* (ATL) differ 
from acquired immune deficiency syndrome - 
(AIDS) in that HTLV-I causes T<eU malignancies, 
that is uncontrolled growth of T-ce*i. in ajdS rather 
than cell growth there is cell de«n m fact this 
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cytopathic characteristic of HTLV III was critical to 
determining ultimately the specific retroviral origin 
of the disease. Thus the etiologic agent of AIDS 
was isolated by use of immortalized human neo- 
plastic T cell lines (HT) infected with the cytopathic 
retrovirus characteristic of AIDS, isolated from AIDS 
afflicted patients. Seroepidemiological assays using 
this virus showed a complete correlation between 
AIDS and the presence of antibodies to HTLV III 
antigens [Sarngadharan, M.G. et at., supra; 
Schupbach, J. et at., supra]. In addition, nearly 
85% of patients with lymphadenopathy syndrome 
and a significant proportion of asymptomatic ho- 
mosexual men in AIDS endemic areas were also 
found to carry circulating antibodies to HTLV III. 
Taken together, all these data indicate HTLV III to 
be the etiologic agent for AIDS. 

Until the successful culturing of AIDS virus 
using H-9 cell line [PCT application, publication no. 
WO 85/04897] the env AIDS protein of the AIDS 
virus had not been isolated, characterized or syn- 
thesized. This in major part is due to the fact that 
the virus is cytopathic and thus isolation of the 
virus was not possible [Popovic, M. et al., supra]. 
Once the human T-cell line resistant to the 
cytopathic effects of the virus was discovered, a 
molecular clone of proviral DNA could be achieved. 

The need for a sensitive and rapid method for 
the diagnosis of AIDS in human blood and its 
prevention by vaccination is very great. Virtually all 
the assays/tests presently available are fraught with 
errors. In fact the Center for Disease Control - 
(CDC) has indicated that presently available tests 
be used solely for screening units of blood for 
antibody to HTLV III. The CDC went further by 
stating that the presently available ELISA tests can 
not be used for general screening of high risk 
pupulations or as a diagnostic test for AIDS - 
[Federal Register 50(48), 9909. March 12, 1985]. 
The errors have been traced to the failure to use a 
specific antigenic protein of the etiologic agent for 
AIDS. The previously used proteins were derived 
from a viral lysate. Since the lysate is made from 
human cells infected with the virus, i.e. the cells 
used to grow the virus, the lysate will contain 
human proteins as well as viral proteins. Thus 
preparation of a pure antigen of viral protein is very 
difficult. The antigen used produced both false 
positive and false negative results [Budiansky, S.,. 
"AIDS Screening, False Test Results Raise 
Doubts", Nature 312, 583(1984)]. The errors caus- 
ed by the use of such lysate prqteins/peptides can 
be avoided by using a composi tion for binding 



AIDS antibodies which is substantially free of the 
non-AIDS specific proteins. Compositions that are 
substantially pure AIDS envelope protein can be 
used as antigens. 

s The AIDS envelope protein of the instant inven- 

tion has been established to have conserved epi- 
topes which permit its use to screen for, diagnose 
and/or prevent by vaccination the infection by AIDS 
virus. The instant invention demonstrates that the 

10 envelope protein with its conserved epitopes in- 
cludes all the variants which have been claimed as 
the sole etiologic agent. 

The envelope AIDS protein of the present in- 
vention may be produced by conventionally known 

is methods. The processes by which the novel pro- 
tein may be produced can be divided into three 
groups: (1 ) chemical synthesis; (2) preparation of a 
gene prepared by chemical synthesis which is in- 
serted into a host and a protein is produced by the 

20 host; and (3) a corresponding gene obtained 
biotechnically is inserted into a host and a protein 
is produced by the host. 

In one embodiment of this invention, recom- 
binant DNA techniques are utilized by which env 

25 AIDS DNA from a natural source is introduced into 
a cell to produce the env AIDS protein. One meth- 
od of obtaining DNA which encodes env AIDS is to 
read the genetic, code in reverse and synthesize an 
oligodideoxynucleotide which should encode the 

30 env AIDS amino acid sequence. As the env protein 
has not been isolated or characterized this ap- 
proach cannot be pursued. 

Alternatively gene expression can be obtained 
using recombinant DNA technology if DNA isolated 

35 from natural sources is used instead of synthetic 
DNA. 

Summary of the Invention 

40 This invention is directed to the engineering of 

HTLV III env gene into suitable expression vectors; 
transformation of host organisms with such expres- 
sion vectors; and production of envelope AIDS pro- 
tein (env AIDS) by culture of such transformed 

45 cells. Another aspect of the present invention re- 
lates to the isolation and use of the resulting re- 
combinant env AIDS protein. 

Another aspect of the present invention is the 
identification and determination of the proviral DNA 

so sequence. More specifically, this aspect of the in- 
vention relates to determination and comparison of 
the proviral nucleotide sequence of the envelope 
genes of the purported etiologic agent of AIDS i.e. 
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lymphadenopathy-associated virus (LAV), AIDS-as- 
sociated retrovirus (ARV) and the human T-cefl 
leukemia/I ymphoma/lymphotropic virus type Hi - 
(HTLV III). 

A further aspect of this invention relates to a s 
diagnostic method for testing human biood for the 
presence of antibodies to the env AIDS protein. 
This aspect of the invention overcomes the prob- 
lems of all previously used blood tests for AIDS. 
One of the problems is the use of compositions to to 
bind AIDS antibody which contain proteins or pep- 
tides which were not derived' solely from the AIDS 
etiologic agent A composition using homogeneous 
envelope AIDS protein of this invention overcomes 
the nonspecificity of the prior tests or assays. Yet 75 
another aspect of this invention is a diagnostic 
method for detecting and/or determining the pres- 
ence of the antigen in human blood. 

Another aspect of this invention is to use the 
env AIDS proteins of the instant invention as anti- 20 
gens suitable for providing protective immunity 
against AIDS when incorporated into a vaccine. 

Brief Description of the Drawings 



Fig. 1. The nucleotide sequence of the en- 
velope gene of the HTLV-III proviral genome 
(HXB-3). 



25 
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Fig. 2. Comparison of the amino acid se- 
quence of the env protein of the five pur- 
ported etiologic agents of AIDS. Amino acid 
sequences are aligned to give maximum ho- 
mology. 3s 

Fig. 3. Construction of the pEV/env44-640 
expression plasmids. The upper left panel 
shows a simplified restriction site map of the 
3.15 Kb EcoRI-Xhoi segment of the HTLV-III 40 
genome which contains the env coding re- 
gion (cross-hatched arrow). The right panel 
shows the structure and pertinent sequences 
of the pEV-vrf plasmids. The solid black 
region represents the synthetic ribosome 45 
binding site sequences upstream of the ATG 
initiation codon (overlined). See Example 2 
for a detailed description of the env expres- 
sion piasmid constructions. 

so 

Fig. 4. Western blot analysis of env coded 
antigens produced in E. coif. TotaJ bacterial 
proteins were resolved by SDS-PAGE. 
electro-blotted onto a nitrocellulose filter, and 
env encoded proteins were detected by 55 
reacting with human sera as described in 



Example 5: a) negative control, cells contain- 
ing pJCL-E30 (p21T) induced at 42°C for 2 
hours; b) uninduced control, cells containing 
pEV3/env44-640 maintained at 30°C; c) 
pEV3/env44-640; d) pEV1/env44-640; and e) 
pEV3/env205-640 induced at 42°C for 2 
hours. 

Rg. 5. Recognition of bacterially synthesized 
HTLV-III env gene products by antibodies in 
AIDS patient sera. Bacterial lysates contain- 
ing recombinant env proteins were subjected 
to Western blot anaJysis as described in 
Example 5. Individual strips were then in- 
cubated with a 1 000-fold dilution of individual 
sera followed by treatment with 12S Wabeled 
protein A. (upper part) Serum samples were 
from the following donors: (lane 1) norma! 
healthy donor; (lanes 2-18) AIDS patient sera 
collected from the West Coast of the USA. - 
(Lower part) Serum samples were taken from 
the following donors: (lane 1) donor found to 
be HTLV-1( + ) by Elisa using cfisrupted 
virus; (lanes 4, 5, 11 and 15) healthy, normal 
donors; (lanes 2, 3, 6, 8, 10 t 12, 13, 14, 16, 
17 and 18) AIDS patient sera from the East 
Coast of the USA. 

Rg. 6A. The amino add sequence of the 
AIDS envelope protein. 

Rg. 6B. The amino acid distribution of the 
AIDS envelope protein. 

Rg. 7. Construction of the expression vector 
pRC23. The Shine-Dalgarno sequence (SD) 
is overlined and the location of the synthetic 
ribosome binding site sequence in the pias- 
mid is represented by the solid black seg- 
ment The piasmid contains the entire se- 
quence of pBR322 and thus confers resis- 
tance to both ampicillin (amp*) and 
tetracycline (tet R ). 

Rg. 8. Construction of the pEV-vrf vectors. 
The synthetic oligonucleotides for each pias- 
mid which were placed downstream of the 
SD sequence in pRC23 are shown with the 
locations of the restriction enzyme cleavage 
sites. The ATG initiation codon is overlined, 
and the placement of the additional A-T base 
pairs is designated by the rectangle. The 
plasmids confer resistance to ampicillin only. 
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Detailed Description of the Invention 

In the description the following terms are em- 
ployed: 

Nucleotide : A monomeric unit of DNA consist- 
ing of a sugar moiety (pentose), a phosphate, and 
either a purine or pyrimidine base (nitrogenous 
heterocyclic). The base is linked to the sugar moi- 
ety via the glycosidic carbon (V carbon of the 
pentose). That combination of a base and a sugar 
is called a nucleoside. Each nucleotide is char- 
acterized by its base. The four DNA bases are 
adenine ( n A n ), guanine f G"), cytosine ("C") and 
thymine ("T"). 

DNA Sequence : A linear array of nucleotides 
connected one to the other by phosphodiester 
bonds between the 3* and 5* carbons of adjacent 
pentoses. 

Codon : A DNA- sequence of three nucleotides 
(a triplet) which encodes through mRNA an amino 
acid, a translation start signal or a translation ter- 
mination signal. For example, the nucleotide triplets 
TTA, TTG, CTT, CTC, CTA and CTG encode for 
the amino acid leucine ("Leu"). TAG, TAA and 
TGA are translation stop signals and ATG is a 
translation start signal. 

Reading Frame : The grouping of codons during 
translation of mRNA into amino acid sequences. 
During translation the proper reading frame must 
be maintained. For example, the sequence 
GCTGGTTGTAAG may be translated in three read- 
ing frames or phases, each of which affords a 
different amino acid sequence: 

GCT GGT TGI AAG = Ala-Gly-Cys-Lys 

G CTG GTT GTA AG = Leu-Val-Val 

GC TGG TTG TAA G = Trp-Leu-(STOP) 

Polypeptide: A linear array .of amino acids con- 
nected one to the other by peptide bonds between 
the a-amino and carboxy groups of adjacent amino 
acids. 

Genome : The entire DNA of a cell or a virus. It 
includes inter alia the structural genes coding for 
the polypeptides of the substance, as well as oper- 
ator, promoter and ribosome binding and inter- 
action sequences, including sequences such as the 
Shine-Dalgarno sequences. 

Structural Gene : A DNA sequence which en- 
codes through its template or messenger RNA 
{"mRNA") a sequence of amino acids characteris- 
tic of a specific polypeptide. 

Transcription : The process of producing mRNA 
from a structural gene. 
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Translation : The process of producing a poly- 
peptide from mRNA. 

Expression : The process undergone by a struc- 
tural gene to produce a polypeptide. It is a com- 
bination of transcription and translation. 

Piasmid : A circular double-stranded DNA mol- 
ecule that is not a part of the main chromosome of 
an organism containing genes that convey resis- 
tance to specific antibiotics. When the piasmid is 
placed within a unicellular organism, the character- 
istics of that organism may be changed or trans- 
formed as a result of the DNA of the piasmid. For 
example, a piasmid carrying the gene for 
tetracycline resistance (Tet R ) transforms a cell pre- 
viously sensitive to tetracycline into one which is 
resistant to it. A cell transformed by a piasmid is 
called a "transformant". 

Cloning Vehicle : A piasmid, phage DNA or 
other DNA sequences which are able to replicate in 
a host cell, which are characterized by one or a 
small number of endonuclease recognition sites at 
which such DNA sequences may be cut in a deter- 
minable fashion without attendant loss of an essen- 
tial biological function of the DNA, e.g., replication, 
production of coat proteins or loss of promoter or 
binding sites, and which contain a marker suitable 
for use in the identification of transformed cells, 
e.g., tetracycline resistance or ampicillin resistance. 
A cloning vehicle is often called a vector. 

Cloning : The process of obtaining a population 
of organisms or DNA sequences derived from one 
such organism or sequence by asexual reproduc- 
tion. 

Recombinant DNA Molecule or Hybrid DNA : A 
molecule consisting of segments of DNA from dif- 
ferent genomes which have been joined end-to-end 
outside of living cells and have the capacity to 
infect some host cell and be maintained therein. 

The nomenclature used, to define the peptides 
or proteins is that used in accordance with conven- 
tional representation such that the amino group at 
the N-terminus appears to the left and the carboxyl 
group at the C-terminus to the right. By natural 
amino acid is meant one of the amino acids com- 
monly occurring in natural proteins comprising Gly, 
Ala, Val, Leu, He, Ser, Thr, Lys, Arg, Asp, Asn, Glu, 
Gin, Cys, Met, Phe, Tyr t Pro, Trp and His. By Nle 
is meant norleucine, and by Nva is meant nor- 
valine. Where L and D forms are possible, it is the 
L-form of the amino acid that is represented unless 
otherwise expressly indicated. In addition, amino 
acids have been designated by specific letter* of 
the alphabet such that: A = Alanine; B ■ Asoartic 
Acid or Asparagine; C = Cysteine; D = Aspatdc Acx3; 
E = Glutamic Acid; F = Phenylalanine; G » Gfyane: 
H = Histidine; I = Isoleucine; K = Lysine; L • Lauone; 
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M = Methionine; N = Asparagine; P = Proline; 
Q = Glutamine; R = Arginine; S = Serine; 
T - Threonine; V = Valine; W = Tryptophan; 
Y= Tyrosine; Z = Glutamine or Glutamic Acid. 

In accordance with the present invention, the 
search for the envelope protein of the etiologic 
agent for acquired immune deficiency syndrome 
(AIDS) has led to the isolation and sequencing of 
the proviral gene of the AIDS virus. It has now 
been discovered, for what is believed to be the first 
time that the postulated etiologic agents of AIDS, 
lymphadenopathy-associated virus (LAV), AIDS-as- 
sociated retrovirus (ARV) and human T-cell 
leukemia/Iymphoma/Iymphotropic virus (HTLV III) 
are in fact variants of the same virus. For purposes 
of this invention, in the specification and claims the 
virus causing AIDS will be referred to herein as 
AIDS virus. AIDS virus will be understood to in- 
clude the variants which have been postulated as 
the causative agents of AIDS, namely LAV, ARV 
and HTLV III. The envelope protein of the AIDS 
virus (env AIDS) is a 97.200 dalton protein with 32 
potential N-glycosylation sites.- Nucleotide se- 
quence analysis of the AIDS envelope gene of the 
putative etiologic agents of AIDS demonstrates that 
all the viruses are variants of the same virus. That 
means that there is approximately 1 to 20% diver- 
gence or variation from the sequence of the en- 
velope gene of HTLV III and the sequences of the 
envelope genes of the other viruses LAV and ARV- 
2. The amino acid sequence of the env AIDS is set 
forth in Figure 6(a). The amino acid distribution is 
set forth in Figure 6(b). 

The nucleotide sequence of the envelope gene 
is shown in Figure 1. The proviral DNA sequence, 
using methods known to one of ordinary skill in the 
art such as the chemical degradation method of 
Maxam and Gilbert of the M13 sequencing system 
of Messing which is a modification of the dideoxy 
nucleotide chain termination method of Sanger, 
was analyzed to determine the location of the re- 
gion coding for the envelope protein. The location 
of an open reading frame, i.e. a long stretch of 
triplet codons not interrupted by a translation^ stop 
codon, for the envelope gene was determined. The 
open reading frame coding for the env gene is 863 
amino acids and contained an ATG codon at the 
eighth position from the 5' end of the reading 
frame. The ATG codon is known to be a universal 
translation-initiation codon. 

The integrated proviral genome of HTLV-ill was 
cloned from the genomic DNA of H9 cells infected 
with HTLV-III [Shaw, G.M. et al. f "Molecular char- 
acterization of Human T-cell leukemia 
(lymphotropic) virus type III in the acquired im- 
mune deficiency syndrome", Science 226, 1165- 



1171 (1984)]. Since the HTLV-III provirus was 
found to lack Xbal restriction sites, a genomic 
library was constructed by using Xbal digested 
H9/HTLV-III DNA. There are several methods avail- 

s able to one of ordinary skill in the art for screening 
the bacterial clones containing the AIDS env pro- 
tein cDNA. These include, for example, RNA selec- 
tion hybridization, differential hybridization with a 
synthetic probe or screening for clones that pro- 

70 duce the desired protein by immunological or bio- 
logical assays. From the genomic library, colonies 
of cells transformed with DNA that contains the 
HTLV 111 sequences were selected by hybridization 
screening of the library with HTLV III cDNA. The 

is DNA insert of the hybridization-positive clone, 
HXB-3. was excised from the plasmid DNA and 
sequenced. 

The predicted product of the env gene shares 
many features in common with the envelope gene 

20 products of other retroviruses. Thus, a hydrophobic 
region is seen in the middle of the protein (amino 
acids 519-534) which includes a processing site for 
the cleavage of the precursor protein into exterior 
and transmembrane proteins. Similarly, the amino 

25 terminal end contains a short stretch of hydropho- 
bic amino acids (amino acids 17-37) which con- 
stitutes a potential signal sequence. The HTLV-III 
envelope precursor differs from the other retroviral 
envelope protein precursors in that it contains an 

30 additional stretch of 180 amino acids' at the car- 
boxy terminus. 

Polymorphism within the Envelope Region of AIDS 
Virus 

35 

The recent publication of the nucleotide se- 
quences of LAV. ARV-2 and HTLV-III [Ratner, L, et 
al., supra; Sanchez-Pescadon, R. t et al., supra; 
Wain-Hobson, S.. et a!., supra] allows a detailed 

40 comparison of these various isolates obtained from 
AIDS patients from different parts of the^ world. 
HTLV-III clones were isolated from AIDS 'patient 
lymphocytes obtained from the east coast of the 
United States, while LAV was isolated from a 

46 French man and ARV was isolated from a patient in 
California. A comparison of the sequence data con- 
firms the earlier observations made using restric- 
tion enzyme site analysis which showed approxi- 
mately 10% variation. The present analysis shows 

so that the various isolates show the greatest amount 
of conservation in the gag and pol regions while 
the most divergence occurs in the env region. A 
comparison of the five env sequences is presented 
in Rgure 2. With respect to the envelope gene, 

55 HTLV-III and LAV are more closely related to each 
other than the ARV clone. Approximately 1.6% 
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divergence was observed between the HTLV-III - 
(HXB-3) and LAV sequence. Among the HTLV se- 
quences, the divergence was about 1.6%. How- 
ever, approximately 17% divergence was observed 
between HTLV-III and ARV-2 and this was more 
pronounced in the extracellular region of the en- 
velope gene product (Figure 2), This high rate of 
divergence could be due to the geographical loca- 
tion from where the two isolates were derived or 
the time of isolation of these variants. ARV-2 was 
isolated from the west coast of the United States 
more recently. The HTLV-III isolates for which the 
nucleotide sequences have been determined were 
all obtained from the east coast of the United 
States a year earlier. LAV was obtained from a 
French patient who appears to have acquired the 
virus in New York about the same period. The 
observed differences in the sequence probably re- 
flect divergent evolution of strains separated in 
time or geography or both. Within the env region, 
the highest level of divergence is in the extracel- 
lular portion of the protein. 

Expression Vector 

A wide variety of host/cloning vehicle combina- 
tions may be employed in cloning the double- 
stranded DNA. For example, useful cloning ve- 
hicles may consist of segments of chromosomal, 
nonchromosomal and synthetic DNA sequences, 
such as various known bacterial plasmids, e.g. 
plasmids from E. coli such as pBR322, phage 
DNA, and vectors derived from combinations of 
plasmids and phage DNAs such as plasmids which 
have been modified to employ phage DNA or other 
expression control sequences or yeast plasmids. 
Useful hosts may include microorganisms, mam- 
malian cells, plant cells and the like. Among them 
microorganisms and mammalian cells are prefer- 
ably employed. As preferable microorganisms, 
there may be mentioned yeast and bacteria such 
as Escherichia coli, Bacillus subtilis, Bacillus 
stearothermophilus and Actinomyces. The above- 
mentioned vectors and hosts may also be em- 
ployed for the production of a protein from a gene 
obtained biologically as in the instant invention. Of 
course, not all host/vector combinations may be 
equally efficient. The particular selection of 
host/cloning vehicle combination may be made by 
those of skill in the art after due consideration of 
the principles set forth without departing from the 
scope of this invention. 

Furthermore, within each specific cloning ve- 
hicle, various sites may be selected for insertion of 
the double-stranded DNA. These sites are usually 
designated by the restriction endonuclease which 



cuts them. For example, in pBR322 the EcoRI site 
is located just outside the gene coding for ampicil- 
lin resistance. Various sites have been employed 
by others in their recombinant synthetic schemes. 

5 Several sites are well recognized by those of skill 
in the art. it is, of course, to be understood that a 
cloning vehicle useful in this invention need not 
have a restriction endonuclease site for insertion of 
the chosen DNA fragment. Instead, the vehicle 

w could be joined to the fragment by alternative 
means. 

The vector or cloning vehicle and in particular 
the site chosen therein for attachment of a selected 
DNA fragment to form a recombinant DNA mol- 

is ecule is determined by a variety of factors, e.g., 
number of sites susceptible to a particular restric- 
tion enzyme, size of the protein to be expressed, 
susceptibility of the desired protein to proteolytic 
degradation by host cell enzymes, contamination of 

20 the protein to be expressed by host cell proteins 
difficult to remove during purification, expression 
characteristics, such as the location of start and 
stop codons relative to the vector sequences, and 
other factors recognized by those of skill in the art. 

25 The choice of a vector and an insertion site for a 
particular gene is determined by a balance of these 
factors, not air selections being equally effective for 
a given case. 

There are several known methods of inserting 

30 DNA sequences into cloning vehicles to form re- 
combinant DNA molecules which are equally useful 
in this invention. These include, for example, direct 
ligation, synthetic linkers, exonuclease and 
polymerase-linked repair reactions followed by liga- 

3$ tion, or extension of the DNA strand with DNA 
polymerase and an appropriate single stranded 
template followed by ligation. 

It should, of course, be understood that the 
nucleotide sequences of the DNA fragment insert- 

40 ed at the selected site of the cloning vehicle may 
include nucleotides which are not part of the actual 
structural gene for the desired polypeptide/protein 
or may include only a fragment of the complete 
structural gene for the desired protein. It is only 

'45 required that whatever DNA sequence is inserted, a 
transformed host will produce a protein/peptide 
having an immunological activity to the AIDS env 
protein or that the DNA sequence itself is of use as 
a hybridization probe to select clones which con- 
so tain DNA sequences useful in the production of 
polypeptides/proteins having an immunological ac- 
tivity to the AIDS env protein. 

The cloning vehicle or vector containing the 
foreign gene is employed to transform a host so as 

55 to permit that host to express the protein or portion 
thereof for which the hybrid DNA codes. The selec- 
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tion of an appropriate host is also controlled by a 
number of factors recognized by the' art. These 
include, for example, compatibility with the chosen 
vector, toxicity of proteins encoded by the hybrid 
plasmid, ease of recovery of the desired protein, s 
expression characteristics, biosafety and costs. A 
balance of these factors must be struck with the 
understanding that not all hosts may be equaJly 
effective for expression of a particular recombinant 
DNA molecule. 10 

A preferred embodiment of the instant inven- 
tion is to express segments of the AIDS env protein 
in E. coli by inserting restriction fragments isolated 
from the cloned pro viral genome into the versatile 
pEV-vrf (variable reading frame) expression plas- is 
mids (for details of construction see Example 2). 
These versatile pEV-vrf plasmids are derivatives of 
pBR322 which contain the phage lambda P L pro- 
moter, a synthetically-derived ribosome-binding 
site, and convenient cloning sites (EcoRI, BamHI, 20 
Clal and Hindlll) just downstream to the initiation 
codon (Figure 8). A set of three plasmids was 
constructed to accomodate all three translational 
reading frames. The Pl promotor is regulated by a 
temperature-sensitive cl repressor encoded on the 2s 
compatible plasmid pRK248clts [ATCC 33766; Ber- 
nard, H.U. and Helinski, D.R., "The use of the X. 
phage promotor P L to promote gene expression in 
hybrid plasmid cloning vehicles'', Meth. Enzymol. 
68, 482-492 (1979)]. These expression plasmids 30 
have been used to produce substantial amounts of 
several heterologous proteins in E. coli including v- 
bas p21 [Lacal, J.C. et al., "Expression of Normal 
and Transforming H-ras genes in E. coli and pu- 
rification of their encoded p21 proteins", PNAS 81 , 35 
5305-5309 (1 984)] and murine interieukin-1 
[Lomedico, P.T. et al., "Cloning and Expression of 
Murine !nterleukin-1 cDNA in E. coli", Nature 312, 
458-462 (1984)]. 

In the present synthesis the preferred initial 40 
cloning vehicle is the bacterial plasmid pBR322 - 
(ATCC 37017) and the preferred initial restriction 
endonuclease sites therein are the EcorRI and Hin- 
dlll sites (Figure 3). Insertion of proviral DNA con- 
tained within the genome of H9 cells into these 45 
sites provides a large number of bacterial clones 
each of which contains one of the proviral DNA 
genes or fragments thereof present in the genome 
of H9 cells. Only a very few of these clones will 
contain the gene for env AIDS or fragments there- so 
of. 



The preferred host for initial cloning and ex- 
pression of the env AIDS gene in accordance with 
this invention is E. coli MC 1061 [Casadaban, MJ. 
and Cohen, S.M., "Analysis of Gene Control Sig- 
nals by DNA Fusion and Cloning in E. coli", J. Mol. 
Biol., 138, 179-207 (1980)]. 

The coding sequences for amino acid residues 
#44 to 640 of the env protein are located down- 
stream of the P L promoter between the Kpnl and 
Hindlll sites on the restriction map as shown in 
Figure 3. Aside from the location of these conve- 
nient restriction sites, these sequences were cho- 
sen for bacterial expression experiments because 
they did not include the amino-terminal signal pep- 
tide as well as the hydrophobic transmembrane 
segment at the carboxyl end. These sequences 
were excluded to avoid possible toxicity problems 
which can occur when hydrophobic proteins are 
over-produced in bacterial cells. In a preferred em- 
bodiment of this invention an expression plasmid 
was constructed that would direct the synthesis of 
this segment of the env gene product (designated 
pEV/env 44-640). an intermediate construction was 
first made by inserting a 2400 bp EcoRI-Hindlll 
fragment between the EcoRI and Hindlll sites in the 
pEV-vrf plasmids. The HTLV-III -sequences (600 
bp) between the EcoRI and the Kpnl site were then 
removed from the intermediate construction as 
shown in Figure 3. These plasmid constructions 
were carried out with all three pEV-vrf plasmids so. 
that subsequent deletions could be made and the 
correct reading frame maintained. In addition, the 
constructions made in the incorrect reading frames 
served as important controls in the expression ex- 
periments described below. 

In another embodiment of this invention, a sec- 
ond set of expression plasmids were constructed in 
a similar fashion by deleting sequences between 
EcoRI and Stul sites which occur 483 bp down- 
stream of the env gene. Again these deletions - 
(designated pEV/env 205-640) were made in all 
three reading frames. The translation termination 
codon used in all of the env expression plasmids is 
presumably an in-frame TAA located 23 bp down- 
stream of the Hindlll site in the plasmid. Thus, 8 
amino acid residues at the carboxyl terminus are 
encoded by pBR322 sequences contained within 
the pEV-vrf expression plasmids. 

Expression of ENV AIDS 

There are several approaches to screen for 
bacterial clones containing env AIDS cDNA. These 
include, for example, RNA selection hybridization, 
differential hybridization, hybridization with a syn- 
thetic probe and screening for clones that produce 
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the desired protein by immunological or biological 
assays. Two methods are available to screen using 
immunological assay: screening of bacterial colo- 
nies for the presence of protein using antibody; 
and, preferably, the bacterial lysates are elec- 
trophoresed, blotted onto a nitrocellulose paper and 
then probed with the antibody. 

In a preferred embodiment of this invention, 
cultures of the E. coli strain MC 1061 transformed 
with pRK248clts and the pEV 1, 2, or 3/env 44-640 
(or pEV 1,2 or 3/env 205-640) were grown in M9 
medium at 30 °C to mid-log phase and then in- 
duced by shifting to 42°C for 2 nr. Samples of the 
bacterial cultures were then taken and subjected to 
SDS-polyacrylamide gel electrophoresis, followed 
by Western blot analysis to detect env proteins. 
The protein blots were treated with antisera to env 
AIDS proteins isolated either from immunized rab- 
bits or from AIDS patients previously shown to 
contain high titer antibodies to AIDS antigens. This 
was followed by incubation with ias l-labelled 
Staphylococcous aureus protein A, washing and 
autoradiography. Similar results were obtained with 
both sera except that the human serum was found 
to contain much higher titers of anti-HTLV-lll anti- 
bodies and was devoid of all background reactivity 
with the E. coli proteins. For this reason human 
antibodies were used in ail subsequent character- 
ization. 

Figure 4 shows the pattern of reactivity of the 
env AIDS proteins synthesized in bacteria 
(recombinant proteins) with anti-HTLV-lll anti- 
bodies. The open reading frame in pEV3/env 44- 
640 encodes a protein that should migrate as a 68 
Kd band on the gel. In fact, a 68 Kd band is 
observed in the lane corresponding to the induced 
cells containing pEV3/env 44-640 (lane C). How- 
ever, in addition to the 68 Kd band, these cells 
synthesized proteins of 35 Kd, 25 Kd and 17 Kd 
which specifically cross-reacted with anti-HTLV-lll 
antibodies. No HTLV-III cross-reacting bands are 
evident in the uninduced control (Lane b) or in a 
second negative control sample (Lane a) of in- 
duced cells containing a plasmid that directs the 
synthesis of v-bas p21 oncogene product (Lacal, 
J.C. et al. t supra). The appearance of multiple 
bands synthesized from the env gene sequences 
was an unexpected result. Another unexpected re- 
sult was the synthesis of env gene products from 
the plasmid (pEV1/env 44-640) where the insert 
was placed in the wrong reading frame with re- 
spect to the initiator codon immediately down 
stream of the P L promoter (Lane d). In this case, E. 
coli cells containing plasmid pEV1/env. 44-640 syn- 
thesized a 63 Kd protein in addition to the 35 Kd, 
25 Kd and 17 Kd proteins. These results could be 



readily explained when the nucleotide sequence of 
the envelope gene (Fig. 1) was examined. About 
155 bases downstream to the Kpnl site is an ATG 
codon which appeared to be utilized for the synthe- 

s sis of the env gene product by the expression 
plasmid pEV1/env 44-640. Internal translation initi- 
ation is also the likely explanation for the appear- 
ance of the 35Kd, 25Kd and 17Kd proteins. Initi- 
ation codons which are preceded by so-called 

io Shine-Dalgarno sequences (AGGA) are found with- 
in the env coding region at locations that are con- 
sistent with the sites of the observed protein pro- 
ducts. 

To confirm the above interpretation and to rule 

is out the possibility that the smaller proteins are not 
formed as a result of premature termination or from 
proteolytic cleavage of the larger product, another 
deletion mutant in which sequences between the 
Kpnl and Stul sites were deleted were constructed. 

20 This expression plasmid contains the coding se- 
quences from amino acid positions 205-640 which 
could code for a protein of 49 Kd. Analysis of the 
proteins induced from E. coli harboring this plas- 
mid verified that, in fact, these cells synthesize a 

25 49 Kd protein in addition to the 35 Kd, 25 Kd and 
17 Kd proteins (lane e, Fig. 4). From these results, 
it was concluded that pEV3/env 44-640 expression 
plasmid directs the synthesis of a 68 Kd protein in 
addition to several additional smaller polypeptides - 

30 (i.e., 35Kd, 25Kd and 17Kd) produced from all of 
the env expression plasmids resulting from internal 
translation initiation within the env gene. 

Screening of AIDS SERA 

35 

Because anti-HTLV-lll antibodies are found in 
more than 90% of the AIDS patients, it was of 
interest to see if the bacterially synthesized env 
gene products could be used as diagnostic tools 
40 for the detection of these antibodies. For this analy- 
sis, total ceil protein from an induced bacterial 
culture was fractionated by SDS-PAGE and trans- 
ferred to a nitrocellulose filter by Western blotting 
technique. Strips of the filter containing transferred 
45 proteins were reacted with 1000-fold diluted human 
sera, and the antigen-antibody complexes formed 
were detected by incubation of the strips with 125- 
l-labelled Staphytococus aureus protein A followed 
by autoradiography. Prominent bands correspond- 
so ing to reaction of the antibody to the 68 Kd, 35 Kd, 
25 Kd and 17 Kd proteins were consistently ob- 
served when the serum used was from patients 
with AIDS syndrome. The results of such assays 
with different human sera are presented in Figure 
55 5. The negative controls used were normal human 
sera and serum from a patient with HTLV-l mfec- 
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tion. No reaction was observed with sera from 
healthy individuals or from HTLV-I infected in- 
dividuals. The patient sera were derived from all 
parts of the United States including California and 
all AIDS patients 1 sera tested so far were found to s 
be positive. The results suggest that these anti- 
bodies are mainly directed against the protein 
back-bone of the molecule. 

it appears, therefore, that the env gene pro- 
ducts constitute the best diagnostic reagents for 10 
the detection of AIDS associated antibodies. The 
env gene product of the instant invention encom- 
passes a large portion of the protein molecule and 
contains both the conserved and divergent portions 
of the molecule. In spite of the divergence ob- 75 
served between HTLVlIt and ARV-2 sequences the 
recombinant env proteins of the instant invention 
synthesized by the bacteria react with AIDS patient 
sera derived from both geographical locations of 
the United States. One hundred percent (100%) of 20 
AIDS patient sera (50 individual samples. 25 de- 
rived from the East Coast of the United States and 
25 derived from California) tested showed high 
reactivity. This is strong evidence for the presence 
of conserved epitopes within the molecule against zs 
which the immune system could mount an anti- 
body reaction. The human immune system may 
thus be mounting an immune response against 
conserved epitopes of the envelope molecule, as 
suggested by the reactivity of the AIDS patient 30 
sera. The observed divergence between various 
isolates of HTLV-III thus may not pose a problem 
for the use of recombinant protein as a vaccine. 
The 68Kd protein is ideally suited for such a pur- 
pose since it encompasses a large portion of the 3s 
gene product and has the unique structural feature 
of containing both the extracellular hydrophilic re- 
gion and the membrane associated hydrophobic 
regions. This structural feature makes it well suited 
for encapsulation into liposomes which have been 40 
used as vehicles for vaccination against other vital 
envelope proteins. 

Based on these discoveries it is proposed that 
in the practice of screening blood for AIDS only 
AIDS envelope protein or a variant of said protein 45 
be utilized. Utilizing the env AIDS protein of the 
instant invention, human blood can be screened for 
the presence of antibodies to the AIDS virus. This 
and other techniques are readily determined, once, 
as taught for the first time by the present invention, so 
the envelope AIDS protein has been recognized to 
be the envelope protein of the etiologic agent of 
AIDS. The foregoing and other objects, features 
and advantages of the invention will be apparent 
'from the following examples of preferred embodi- 55 
ments c; ;he invention. 



Example 1 



Molecular cloning and nucleotide sequence analy- 
sis of the HTLV-III proviral genome. 

The integrated proviral genome of HTLV-III was 
recently cloned from the genomic DNA of H9 cells 
infected with HTLV-III [Shaw, G.M. et al., supra]. 
The proviral genome which was obtained by using 
Xbai digested H9/HTLV-III DNA contained two in- 
ternal EcoRI sites within the viral genome and two 
additional sites in the cloning vector X J1. These 
sites were used for further subcloning of the three 
DNA fragments of 5.5Kb, 4.5Kb and 1.1Kb into 
PBR322 (ATCC No. 37017). Nucleotide sequence 
analysis of the proviral genome was determined by 
the chemical degradation method of Maxam, A.M. 
and Gilbert. W. t "Sequencing end-labelled DNA 
with base-specific chemical cleavages", Meth. En- 
zymol. 65. 499-560 (1980). For the sequence ana- 
lysis, DNA inserts from the three subclones were 
isolated by eiectroelution and further cleaved with 
appropriate restriction enzymes. The DNA frag- 
ments were labelled at their 5'ends with -r32P-ATP 
using polynucleotide kinase, or at their 3* ends with 
a-32P-NTP by filling in with DNA polymerase I - 
(Klenow fragment). The DNA fragments labelled at 
the two ends were cleaved with a second enzyme 
and the fragments labelled at a single end were 
purified on 5% acrylamide gels and used for se- 
quence analysis. For the sequence analysis of the 
env gene, a shotgun approach was utilized where 
the 4.5 EcoRI fragment was cleaved with one of the 
following enzymes: Bglll, Hindlll, Xhol. Avail, Hinfl 
and Sau3A and the restriction fragments labelled 
and sequenced as described above. The 
nucleotide sequence of the envelope gene used in 
the present invention is shown in Figure 1 . 

Example 2 



Construction of pEV/env 44-640 

pRC2 is a derivative of pBR322 containing a 
unique Bgl II site adjacent (on the amp R side) to 
the EcoRI site in the plasmid. This plasmid was 
constructed in the following manner. 20 ug of 
pBR322 plasmid DNA were digested with EcoRI 
and then split into two reactions. In one, the pro- 
truding 5' single-stranded termini were removed 
with S1 nuclease; in the other reaction, the termini 
were filled-in by inorporating deoxynucleotides with 
the Klenow fragment of DNA polymerase I. Both 
reactions were terminated by phenol extraction fol- 
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lowed by ethanot precipitation. Approximately 1 ug 
of DNA from 'each reaction was mixed with 90 
pmoles of phosphorylated Bglli linkers - 
(CAGATCTG, purchased from Collaborative Re- 
search) and incubated with T4 DNA ligase at 15°C 
for 18 hours. The ligation products were then di- 
gested with Bglll and Pstl and subjected to gel 
electrophoresis in 1 % agarose. The 3600 bp' and 
760 bp fragments from both reactions were recov- 
ered from the gel. For the construction of pRC2, 
the 3600 bp from the Klenow reaction was ligated 
to the 760 bp fragment from the S1 reaction. To 
construct a plasmid with the Bglll site on the other 
side of EcoRI (tet R side), designated pRCI, the 
3600 bp fragment from the S1 reaction was ligated 
to the 760 bp fragment from the Klenow reaction. 
E. coli strain RRI (ATCC No. 31343) was trans- 
formed with the ligation mixtures, and transfor- 
mants were selected on LB agar plates containing 
50 ug/ml ampicillin. Transformants containing the 
expected plasmid constructions were identified by 
restriction analysis of the isolated plasmid DNA. 
DNA sequence analysis confirmed that the S1 
nuclease treatment precisely removed the 5' 
single-stranded termini. 

pRC23 (see Figure 7) was constructed by in- 
serting into pRC2 a 250 bp Bglll-Haelll fragment 
containing the X P L promoter joined to a pair of 
complementary synthetic oligonucleotides compris- 
ing a model ribosome-binding site (RBS). The 
Haelll site is located within the 5' non-coding re- 
gion of the XN gene 115 bp downstream of the 
Pi.transcnptionai initiation site. Approximately 1 ug 
of a 450 bp Bglll-Hpal fragment isolated from 
phage X DNA was digested with Haelll. 200 ng of 
the resulting digestion products were mixed with 60 
pmoles each of phosphorylated synthetic 
oligonucleotides containing the model RBS. The 
ligated molecules were digested with Bglll and 
EcoRI and separated on a 5% polyacrylamide gel. 
The 270 bp ligation product was recovered from 
the gel, mixed with gel purified pRC2 vector that 
had been digested with Bglll and EcoRI, and in- 
cubated with T4 DNA ligase at 15°C for 15 hours. 
The ligation mixture was used to transform strain 
RRl(pRK248Clts). Transformants selected on 
ampicillin-containing medium were screened by re- 
striction analysis of the isolated plasmid DNA. The 
expected plasmid construction, pRC23, was con- 
firmed by further restriction enzyme digestions and 
by DNA sequence analysis across the EcoRI junc- 
tion (Fig, 7). 

For the construction of the pEV-vrf set of plas- 
mids (see Figure 8), plasmid pRC23 was digested 
with EcoRI and Hindlll and the pRC23/Ecofll-Hindlll 
vector isolated by preparative agarose gel elec- 



trophoresis. The mixture of synthetic 
oligonucleotides (32. 33. and 34 nucleotides) was 
combined with the mixture of the complementary 
sequences, heated to 58°C for 5 minutes in 150 

5 mM NaCI. and cooled slowly to allow annealing. 0.1 
pmoles of the synthetic duplexes were added to 
0.07 pmoles of the pRC23/EcoRI-Hindlll vector and 
incubated with T4 DNA ligase at 15°C for 15 hours. 
Strain RR1 (X cl857) was transformed with the 

io ligation products. Six ampicillin resistant transfor- 
mants were selected for DNA sequence analysis. 
Of the six, two contained the expected sequence 
for pEV-vrfl, one for pEV-vrf2, and three for pEV- 
vrf3 (Fig. 3). 

is For the expression of the AIDS env gene, one 

ug of a 2400 bp EcoRI -Hindlll DNA fragment, 
which was isolated from the cloned HTLV-III pro- 
viral genome by preparative agarose gel elec- 
trophoresis, was mixed with 0.1 ug of EcoRI - 

20 Hindlll digested vector DNA (pEV-vrff, -2, or -3). 
After heating at 65 °C for 3 minutes, the mixtures 
were chilled on ice, and 20 ul ligation reactions 
were assembled, containing 50 mM Tris-HCI (pH 
7.4), 10 mM MgCI 2 , 10 mM DTT, 0.3 mM ATP, and 

25 200 units of T« DNA ligase. After incubation at 
15°C for 4 hours, the reactions were terminated by 
heating at 65°C for 5 minutes. The ligation pro- 
ducts were used to transform E. coli strain MC1061 
containing plasmid pRK248clts. Transformants 

30 were selected on Luria broth agar containing 50 
ug/ml ampicillin at 30°C for 18 hours. Plasmid 
DNA was isolated from 1 ml of each culture and 
subjected to restriction analysis. All 12 isolates 
contained the expected plasmid construction. 

35 These intermediate constructions were then used 
to make pEVl, -2, and -3/env 44-640 by deleting 
the 600 bp between the EcoRI and Kpnl sites as 
described below. 

Approximately 0.5 ug of plasmid DNA was 

40 digested with Kpnl and EcoRI. The resulting termini 
were then treated with the Kienow fragment of DNA 
polymerase I in the presence of all four deox- 
yribonucieotides (at 100 uM) at 37°C for 30 min- 
utes. This step results in the "filling-in" of the 5* 

45 overhang of the EcoRI terminus and the removal of 
the 3'overhang of the Kpnl terminus. Upon recir- 
culation of the linear plasmid and blunt-end liga- 
. tion of these termini, an EcoRI site is regenerated. 
Transformants containing plasmids with the expect- 
so ed detetion were identified by restriction analysis. 

A second set of deletion derivatives, desig- 
nated pEV/env 205-640 was constructed in a simi- 
lar fashion. A portion of the linear plasmid that had 
been digested with EcoRI and Kpnl and treated 

55 with Klenow, as described above, was further di- 
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gested with Stul. Again, upon recircularization and 
blunt-end ligation, the EcoRI site was regenerated; 
however, an additional 483 bp of env coding se- 
quences were removed. 

Example 3 



Bacterial Growth and Induction of env Gene Ex- 
pression 

Cultures of E. coli strain MC 1061 transformed 
with plasmid pRK248clts and the pEV1, -2, or - 
3/env plasmids were grown in M9 medium contain- 
ing 0.5% glucose and 0.5% casamino acids at 
30 °C to mid-log phase and then induced by shift- 
ing to 42°C for 2 hr. The cells were collected by 
centri fugation and processed as described in Ex- 
amples 4 and 5. 

Example 4 



Expression and Purification of Env AIDS 

A homogeneous recombinant viral env AIDS 
was purified according to the following procedure.* 
The env AIDS protein expressed by a microbe 
tends to associate with the membrane fractions of 
the host microbe, principally the inner membrane 
of the microbe. The following purification method 
was designed to deal with this finding. 

This purification method comprises: 

(a) lysis of transformed microbial cells pro- 
ducing recombinant env AIDS protein; 

(b) separation of env AIDS associated cel- 
lular, membranes from other cellular compo- 
nents; 

(c) extraction of env AIDS from associated 
membranes; and 

(d) chromatographic purification of the resul- 
tant extraction solution containing env AIDS 
to yield a substantially pure recombinant 
viral env protein. 

More specifically, the preferred purification 
method for the preparation of substantially pure 
recombinant viral env protein comprises: 

(a) cultivating a transformed organism con- 
taining a DNA sequence which codes for 
viral env protein; 



(b) causing a culture of the transformed or- 
ganism of step (a) to accumulate the env 
protein; 

5 (c) lysing the culture of transformed organ- 

isms of step (b) to form a cell lysate mixture; 

(d) isolating the cell membrane components 
of the cell lysate mixture of step (c); 

10 

(e) washing the isolated cell membrane com- 
ponents with an extraction solution to yield a 
wash solution containing env protein; and 

; 5 (f) chromatographically purifying the wash 

solution of step (e) to yield a substantially 
pure env AIDS protein. 
In carrying out this method it is preferred that 
the cells be lysed by sonification, although it is 
20 forseeable that other known methods such as en- 
zyme or mechanical lysis could also be used. It is 
preferred that the cell membrane component spe- 
cificaJly the inner and outer membranes, be iso- 
lated from other cellular components by methods 
25 such as centrifugation. It has been found that env 
AIDS expressed by the transformed microorganism 
tends to become associated with the cellular mem- 
■ branes. Therefore, isolation of these membranes 
during the purification process ensures high pu- 
30 rification levels and high purity env AIDS at the end 
of the purification procedure. 

Once the cell membranes are isolated from the 
lysate mixture, they are washed with an extraction 
solution, preferably salt solutions and a detergent 
35 to yield a second solution containing approximately 
50% env AIDS protein. Preferably the cell mem- . 
branes are washed in four separate steps wfth the 
salt solutions and detergent although ft is forseea- 
ble that certain of these steps could be combined, 
40 rearranged or eliminated. The first step of washing 
the cell membrane may be done with a salt solu- 
tion, preferably 1M NaCI. In the second step the 
cell membrane is washed with a detergent solution, 
preferably 1% Triton X-100. In the third step, the 
4S cell membrane is washed with another salt solution, 
1.75M to 3.5M guanidine HCI. The final wash is 
also with a salt solution preferably about 7M 
Guanidine HCI. The wash solution which results 
from the fourth and final wash comprises about 
so 50% env AIDS. 

The final 50% env AIDS wash solution is then 
further purified by a chromatography step, prefer- 
ably reverse phase high performance liquid 
chromatography (HPLC). The HPtC step yields 
55 env AIDS protein in' a substantially 100% pure 
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form. It is also foreseeable that monoclonal anti- 
body affinity chromatography columns utilizing env 
AIDS polyclonal or monoclonal antibodies, could be 
used as an alternative to HPLC. 

Example 5 



Polyacrylamide gel electrophoresis and Western 

blot analysis 1Q 

Cells were lysed by resuspending the cell pel- 
lets (approximately 10 1 cells) in TG buffer (10 mM 
Tris, pH 7.4, 10% glycerol), mixed with an equal 
volume of 2 x sample buffer [Laemmli, U.K., is 
"Cleavage of Structural Proteins During the Assem- 
bly of the Head of Bacteriophage T4", Nature 227, 
680-685 (1970)] and incubated at 95 °C for five (5) 
minutes. Cell debris were pelleted by centrifugation 
and the cleared lysates were subjected to SDS- 20 
PAGE analysis [Laemmli, U.K., supra]. For Western 
blot analysis, the proteins from the acrylamide gel 
were electroblotted onto a 0.1 am nitrocellulose 
membrane (Schleicher and Schuell) for 16 hr at 
50V, in 12,5 mM Tris, 96 mM glycine, 20% metha- 25 
npl. 0.01 % SDS at pH 7.5. Processing of the blot 
was carried out using the methods described by 
Towbin, H. et al. ["Electrophoretic Transfer of Pro- 
teins From Polyacrylamide Gels to Nitrocellulose 
Sheets: Procedure and Some Applications", Proc. 30 
Natl. Acad. Sci. U.S.A., 76, 4350-4354, (1979)]. For 
treatment with the human sera, the blots were 
incubated with a 1000 fold dilution of the sera in 
antibody buffer (20 mM sodium phosphate buffer, 
pH 7.5, containing 0.5 M NaCI, 1% BSA and 0.05% 35 
Tween 20) for 2-6 hr. The blots were then washed 
twice with phosphate buffered saline containing 
0.05% Tween 20 and then incubated with 125-1- 
labelled Staphylococous aureus protein A for an 
additional period of 1 hr. The blot was then washed 40 
twice in PBS-Tween 20 buffer, dried and auto- 
radiographed. 

Example 6 



Immunization with Env Protein of AIDS Virus 

It is clear that in spite of the divergence ob- 
served between HTLVIII and ARV-2 sequences, the 50 
recombinant proteins synthesized by the bacteria 
react well with AIDS patients 1 sera derived from 
both geographical locations of the United States. 
One hundred percent (100%) of the AIDS patients* 
sera tested showed high reactivity (50 individual ss 
samples, 25 from the east coast of the United 



States and 25 from the west coast of the United 
States). Thus all the env proteins' contain at least 
one conserved epitope. All of the human sera from 
AIDS patients tested contained antibodies to the 
env proteins of the instant invention. This strongly 
suggests that these env proteins with the con- 
served epitopes would be immunogenic in man. 

It will be readily appreciated that the env pro- 
teins of the instant invention can be incorporated 
into vaccines capable of inducing protective immu- 
nity against the AIDS virus. By methods known in 
the art, the specific amino acids conprising the 
epitopes of* the env protein may be determined. 
Peptides may then be synthesized, comprising an 
amino acid sequence corresponding to an epitope 
of an env AIDS protein either in monomeric or 
multimeric form. These synthetic peptides may 
then be incorporated into vaccines capable of in- 
ducing protective immunity against AIDS virus. 
Techniques for enhancing the antigenicity of such 
peptides include incorporation into a multimeric 
structure, binding to a highly immunogenic protein 
carrier, for example, keyhold limpet hemocyanin, or 
diphtheria toxoid, and administration in combination 
with adjuvants or any other enhancers of immune 
response. In addition, the vaccine composition may 
comprise antigens to provide immunity against oth- 
er diseases in addition to AIDS. 

An amino acid sequence corresponding to an 
epitope of an env protein either in monomeric or 
multimeric form - (peptide) may be obtained by 
chemical synthetic means or by purification from 
biological sources including genetically modified 
microorganisms or their culture media. The peptide 
may be combined in an amino acid sequence with 
other peptides including fragments of other pro- 
teins, as for example, when synthesized as a fusion 
protein, or linked to other antigenic or non-antigenic 
peptides of synthetic or biological origin. The term 
"corresponding to an epitope of a env protein" will 
be understood to include the practical possibility 
that, in some instances, amino acid sequence vari- 
ations of a naturally occurring peptide may be 
antigenic and confer protective immunity against 
AIDS infection. Possible sequence variations in- 
clude, without limitation, amino acid substitutions, 
extensions, deletions, interpolations and combina- 
tions thereof. Such variations fall within the con- 
templated scope of the invention provided the pep- 
tide containing them is antigenic and antibodies 
elicited by such peptide cross-react with naturally 
occurring env protein or non-variant repeated pep- 
tides of env protein, to an extent sufficient to pro- 
vide protective immunity when administered as a 
vaccine. Such vaccine compositions will be com- 
bined with a physiologically acceptable medium. 
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The size and shape of epitopes found in carbohy- 
drate antigens have been extensively studied, but 
less is known about the structure of epitopes from 
protein molecules. Some epitopes of protein anti- 
gens have been defined at the level of their tertiary 
structure. In every instance, the epitopes were 
formed not by the primary sequences alone, but by 
the juxtaposition of residues brought together by 
the folding of the polypeptide chain(s) of the native 
molecule. In addition, the structure of the 68Kd env 
protein of the instant invention makes it particularly 
well suited for use as a vaccine. The 68Kd env 
protein comprises a large portion of the gene prod- 
uct which (a) was shown to be reactive with all the 
AIDS sera tested; and (b) has the unique structural 
feature of containing both an extracellular hydro- 
philic region and the transmembrane hydrophobic 
regions. The latter structural feature makes it well 
suited for use as a vaccine using liposome encap- 
sulation to create a vehicle for administration. 

Routes of administration, antigen dose, number 
and frequency of injections are ail matters of op- 
timization within the scope of ordinary skill in the 
art, particularly in view of the fact that there is 
experience in the art in providing protective immu- 
nity by the injection of other related antigens to 
provide immunity in other viral infections. It is an- 
ticipated that the principal value of providing immu- 
nity to AIDS infection will be for those individuals 
who have had no previous exposure to AIDS,. e.g., 
individuals who are in the high risk population, such 
as homosexuals, drug addicts and people from 
Haiti and Central America and individuals who may 
be receiving blood transfusions. It is also antici- 
pated that temporary immunity for infants may be 
provided by immunization of mothers during preg- 
nancy. 

Example 7 



Diagnostic Test for AIDS 

It is clear that the env gene proteins of the 
instant invention may be used as diagnostic 
reagents for the detection of AIDS-associated anti- 
bodies. It is also apparent to one of ordinary skill 
that a diagnostic assay for AIDS using polyclonal or 
monoclonal antibodies to the AIDS env protein of 
the instant invention may be used to detect the 
presence of the AIDS virus in human blood. In one 
embodiment a competition immunoassay is used 
where the antigenic substance, in this case the 
AIDS virus, in a blood sample competes with a 
known quantity of labelled antigen, in this case 
labelled AIDS env protein, for a limited quantity of 



antibody binding sites. Thus, the amount of label- 
led antigen bound to the antibody is inversely 
proportional to the amount of antigen in the sam- 
ple. In another embodiment, an immunometric as- 
s say may be used wherein a labelled AIDS-env 
antibody is used. In such an assay, the amount of 
labelled antibody which complexes with the 
antigen-bound antibody is directly proportional to 
the amount of antigen (AIDS virus) in the blood 
10 sample. In a simple yes/no assay to determine 
whether the AIDS virus is present in blood, the 
solid support is tested to detect the presence of 
labelled antibody. In another embodiment mon- 
oclonal antibodies to AIDS env protein may be 
is used in an immunometric assay. Such monoclonal 
antibodies may be obtained by methods well 
known in the art particularly the process of Milstein 
and Kohler reported in Nature 256, 495-497 (1975). 
The immunometric assay method is as follows: 
20 Duplicate samples are run in which 100 ul of a 
suspension of antibody immobilized on agarose 
particles is mixed with 100 ul of serum and 100 u\ 
of soluble '^-labelled antibody. This mixture is for 
specified times ranging from one quarter hour to 
25 twenty four hours. Following the incubation periods 
the agarose particles are washed by addition of 
buffer and then centrifuged. After removal of the 
washing liquid by aspiration, the resulting pellet of 
agarose particles is then counted for bound ^l- 
30 labelled antibody. The counts obtained for each of 
the complexes can then be compared to controls. 

While the invention has been described in 
terms of certain preferred embodiments, modifica- 
tions obvious to one with ordinary skill in the art 
35 may be made without departing from the scope of 
the invention. For example, it is understood that the 
env AIDS DNAs described herein represent only 
the precise structure of two naturally occurring 
gene segments. It is expected that slightly modified 
40 alleles will be found encoding for similarly function- 
ing proteins, and such gene segments and proteins 
are considered to be equivalents for the purpose of 
this invention. It is also suspected that other vari- 
ants in addition to those described herein will be 
45 found and that the envelope protein of said variants 
will differ slightly. These variant envelope proteins 
are likewise considered within the scope of the 
invention. DNA having equivalent codons is consid- 
ered within the scope of the invention, as are 
so synthetic gene segments that encode homologous 
proteins of the viral envelope. 

Various features of the invention are set forth in 
the following claims. 

55 
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Claims 



1. An envelope protein of an acquired immune 
deficiency syndrome (AIDS) virus comprising the 
amino acid sequence: 



ValLysGluLysTyrGlnHisLeuTrpArgTrpGlyTrpArgTrpGlyThrMETLeuLeuGlyMETLeu 

METIleCysSerAlaThrGluLysLeuTrpValThrValTyrTyrGlyValProValTrpLysGluAla 

ThrThrThrLeuPheCysAlaSerAspAlaLysAlaTyrAspThrGluValHisAsnValTrpAlaThr 

HisAlaCysValProThrAspProAsnProGlnGluValValLeuValAsnValThrGluAsnPheAsn 

METTrpLysAsrAspMETValGluGlnMETHisGluAspIlelleSerLeuTrpAspGlnSerLeuLys 

ProCysValLysLeuThrProLeuCysValSerLeuLysCysThrAspLeuLysAsnAspThrAsnThr 

AsnSerSerSerGlyArgMETIleMEXGluLysGlyGluIleLysAsnCysSerPheAsnlleSerThr 

SerlleArgGlyLysValGlnLysGluTyrAlaPhePheTyrLysLeuAspIlelleProIleAspAsn 

AspThrThrSerTyrThrLeuThrSerCysAsnThrSerVallleThrGlnAlaCysProLysValSer 

PheGluProIleProIleHisTyrCysAlaProAlaGlyPheAlalleLeuLysCysAsnAsnLysThr 

PheAsnGlyThrGlyProCysThrAsnValSerThrValGlnCysThrHisGlylleArgProValVal 

SerThrGlnLeuLeuLeuAsnGlySerLeuAlaGluGluGluValValHeArgSerValAsnPheThr 

AspAsnAlaLysThrllelleValGlnLeuAsnThrSerValGluIleAsnCysThrArgProAsnAsn 

AsnThrArgLysLysIleArglleGlnArgGlyProGlyArgAlaPheValThrlleGlyLysIleGly 

AsnMETArgGlnAlaHisCysAsnlleSerArgAlaLysTrpAsnAlaThrLeuLysGlnlleAlaSer 

LysLeuArgGluGlnPheGlyAsriAsnLysThrllellePheLysGlnSerSerGlyGlyAspProGlu 

IleValThrHisSerPheAsnCysGlyGlyGluPhePheTyrCysAsnSerThrGlnLeuPheAsnSer 

ThrTrpPheAsnSerThrTrpSerThrGluGlySerAsnAsnThrGluGlySerAspThrlleThrLeu 

ProCysArglleLysGlnPhelleAsnMETTrpGlnGluValGlyLysAlaMETTyrAlaProProIle 

SerGlyGlnlleArgCysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGlyAsriAsnAsn 

AsnGlySerGluIlePheArgProGlyGlyGlyAspMETArgAspAsnTrpArgSerGluLeuTyrLys 

TyrLysValValLyslleGluProLeuGLyValAlaProThrLysAlaLysArgArgValValGlnArg 

GluLysArgAlaValGlylleGlyAlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerThrMETGly 

AlaAlaSerMETThrLeuThrValGlnAlaArgGlnLeuLeuSerGlylleValGlnGlnGlnAsnAsn 

LeuLeuArgAlalleGluAlaGlnGlnHisLeuLeuGlnLeuThrValTrpGlylleLysGlnLeuGln 

AlaArglleLeuAlaValGluArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCysSerGly 

LysLeuIleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlLeTrp 

AsnHisThrThrTrpMETGluTrpAspArgGluIleAsnAsnTyrThrSerLeuIleHisSerLeuIle 

GluGluSerGlnAsnGlnGlnGluLysAsnGluGlnGluLeuLeuGluLeuAspLysTrpAlaSerLeu 

TrpAsnTrpPheAsnlleThrAsnTrpLeuTrpTyrlleLysLeuPhelleMETIleValGlyGlyLeu 

ValGlyLeuArglleValPheAlaValLeuSerValValAsnAr^ValArgGlnGlyTyrSerProLeu 

SerPheGlnThrHisLeuProIleProArgGlyProAspArgProGluGlylleGluGluGluGlyGly 

GluArgAspArgAspArgSerlleArgLeuValAsnGlySerLeuAlaLeuIleTrpAspAspLeuArg 

SerLeuCysLeuPheSerTyrHisArgLeuArgAspLeuLeuLeuIleValThrArglleValGluLeu 

LeuGlyArgArgGlyTrpGluAlaLeuLysTyrTrpTrpAsnLeuLeuGlnTyrTrpSerGlnGluLeu 

LysAsnSerAlaValSerLeuLeuAsnAlaThrAlalleAlaValAlaGluGlyThrAspArgVallLe 

GluValValGlnGluAlaTyrArgAlalleArgHisHeProArgArglleArgGlnGlyLeuGluArg 
IleLeuLeu 



or fragments thereof. 

50 

2. An envelope protein of an AIDS virus according 
to claim 1 with the amino acid sequence: 
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METArg 

ValLysGluLysTyrGlnHisLeuTrpArgTrpGlyTrpArgTrpGlyThrMETLeuLeuGLyMETLeu 
METIleCysSerAlaThrGluLysLeuTrpValThrValTyrTyrGlyValProValTrpLysGluAla 
ThrThrThrLeuPheCysAlaSerAspAlaLysAlaTyrAspThrGluValHisAsnValTrpALaThr 
HisAlaCysValProThrAspProAsnProGlnGluValValLeuValAsnValTh'rGluAsnPheAsn 
METTrpLysAsnAspMEWalGluGlnMETHisGluAspIlelleSerLeuTrpAspGlnSerLeuLys 
ProCysValLysLeuThrProLeuCysValSerLeuLysCysThrAspLeuLysAsnAspThrAsnThr 
AsnSerSerSerGlyArgMETIleMETGluLysGlyGluIleLysAsnCysSerPheAsnlleSerThr 
SerlleArgGlyLysValGlnLysGluTyrAlaPhePheTyrLysLeuAspIlelleProIleAspAsn 
AspThrThrSerTyrThrLeuThrSerCysAsnThrSerVallleThrGlnAlaCysProLysValSer 
PheGluProlleProIleHisTyrCysAlaProAlaGlyPheAlalleLeuLysCysAsnAsnLysThr 
PheAsnGlyThrGlyProCysThrAsnValSerThrValGlnCysThrHisGlylleArgProValVal 
SerThrGlnLeuLeuLeuAsnGlySerLeuAlaGluGluGluValVallleArgSerValAsnPheThr 
AspAsnAlaLysThrlLelleValGlnLeuAsnThrSerValGluIleAsnCysThrArgProAsnAsn 
AsnThrArgLysLysIleArglleGlnArgGlyProGlyArgAlaPheVaLThrlleGlyLysIleGly 
AsnMETArgGlnAlaHisCysAsnlleSerArgAlaLysTrpAsnAlaThrLeuLysGlnlleAlaSer 
LysLeuArgGluGlnPheGlyAsnAsnLysThrllellePheLysGlnSerSerGlyGlyAspProGlu 
IleValThrHisSerPheAsnCysGlyGlyGluPhePheTyrCysAsnSerThrGlnLeuPheAsnSer 

- ThrTrpPheAsnSerThrTrpSerThrGluGlySerAsnAsnThrGluGlySerAspThrlleThrLeu 
ProCysArglleLysGlnPhelleAsnMETTrpGlnGluValGlyLysAlaMETTyrAlaProProIle 
SerGlyGlnlleArgCysSerSerAsnlLeThrGlyLeuLeuLeuThrArgAspGlyGlyAsnAsnAsn 
AsnGlySerGluIlePheArgProGlyGlyGlyAspMETArgAspAsnTrpArgSerGluLeuTyrLys 
TyrLysValValLysIleGluProLeuGlyValAlaProThrLysAlaLysArgArgValValGlnArg 
GluLysArgAlaValGlylleGlyAlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerThrMETGly 
AlaAlaSerMETThrLeuThrValGlnAlaArgGLnLeuLeuSerGlylleValGlnGlnGLnAsnAsn 
LeuLeuArgAlalleGluAlaGlnGlnHisLeuLeuGlnLeuThrValTrpGlylleLysGlnLeuGln 
AlaArglleLeuAlaValGluArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCysSerGly 
LysLeuIleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrp 
AsnHisThrThrTrpMETGluTrpAspArgGluILeAsnAsnTyrThrSerLeulleHisSerLeuIle 
GluGluSerGlnAsnGlnGlnGluLysAsnGluGlnGluLeuLeuGluLeuAspLysTrpAlaSerLeu 
TrpAsnTrpPheAsnlleThrAsnTrpLeuTrpTyrlleLysLeuPhelleMETIleValGlyGlyLeu 
ValGlyLeuArglleValPheAlaVaLLeuSerValValAsnArgValArgGlnGlyTyrSerProLeu 
SerPheGlnThrHisLeuProlleProArgGlyProAspArgProGluGlylleGluGluGluGlyGly 
GluArgAspArgAspArgSerlleArgLeuValAsnGlySerLeuAlaLeuIleTrpAspAspLeuArg 
SerLeuCysLeuPheSerTyrHisArgLeuArgAspLeuLeuLeuILeVaLThrArglleValGluLeu 
LeuGlyArgArgGlyTrpGluAlaLeuLysTyrTrpTrpAsnLeuLeuGinTyrTrpSerGlnGluLeu 

= LysAsnSerAlaValSerLeuLeuAsnAlaThrAlalleAlaVaLAlaGluGlyThrAspArgVallle 
GluValValGlnGluAlaTyrArgAlalleArgHisIleProArgArglleArgGlnGlyLeuGluArg 
IleLeuLeu 

3. An envelope protein of an AIDS virus according 
to claim 1 with the amino acid sequence: 
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ValTrpLysGluAla 

ThrThrThrLeuPheCysAlaSerAspAlaLysAlaTyrAspThrGluValHisAsnValTrpAlaThr 
HisAlaCysValProThrAspProAsnProGlnGluValValLeuValAsnValThrGluAsnPheAsn 
METTr pLys AsnAspMETVa lGluG InMETHisGluAspI lei leSe rLeuTrpAspGlnSe rLeuLys 
ProCysValLysLeuThrProLeuCysValSerLeuLysCysThrAspLeuLysAsnAspThrAsnThr 
AsnSerSerSerGlyArgMETIleMETGluLysGlyGluIleLysAsnCysSerPheAsnlleSerThr 
SerlleArgGlyLysValGlnLysGluTyrAlaPhePheTyrLysLeuAspIlelleProIleAspAsn 
AspThrThrSerTyrThrLeuThrSerCysAsnThrSerVallleThrGlnAlaCysProLysValSer 
PheGluProIleProIleHisTyrCysAlaProAlaGlyPheAlalleLeuLysCysAsnAsnLysThr 
PheAsnGlyThrGlyProCysThrAsnValSerThrValGlnCysThrHisGlylleArgProValVal 
SerThrGlnLeuLeuLeuAsnGlySerLeuAlaGluGluGluValVaiHeArgSerValAsnPheThr 
AspAsnAlaLysThrllelleValGlnLeuAsnThrSerValGluIleAsnCysThrArgProAsnAsn 
AsnThrArgLysLysIleArglleGlnArgGlyProGlyArgAlaPheValThrlleGlyLysIleGly 
AsnMETArgGlnAlaHisCysAsnlleSerArgAlaLysTrpAsnAlaThrLeuLysGlnlleAlaSer 
LysLeuArgGluGlnPheGlyAsnAsnLysThrllellePheLysGlnSerSerGlyGlyAspProGlu 
IleValThrHisSerPheAsnCysGlyGlyGluPhePheTyrCysAsnSerThrGlnLeuPheAsnSer 
ThrTrpPheAsnSerThrTrpSerThrGluGlySerAsnAsnThrGluGlySerAspThrlleThrLeu 
ProcysArglleLysGlnPhelleAsnMETTrpGlnGluValGlyLysAlaMETTyrAlaProProIle 
SerGlyGlnlleArgCysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGlyAsnAsnAsn 
AsnGlySerGluIlePheArgProGlyGlyGlyAspMETArgAspAsnTrpArgSerGluLeuTyrLys 
TyrLysValValLysIleGluProLeuGlyValAlaProThrLysAlaLysArgArgValValGlnArg 
GluLysArgAlaVaLGlyIle<?lyAlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerThrMETGly 
AlaAlaSerMETThrLeuThrValGlnAlaArgGlnLeuLeuSerGlylleValGlnGlnGlnAsnAsn 
LeuLeuArgAlalleGluAlaGlnGlnHisteuLeuGlnLeuThrValTrpGlylleLysGlnLeuGln 
AlaArglleLeuAlaValGluArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCysSerGly 
LysLeulleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrp 
AsnHisThrThrTrpMETGluTrpAspArgGluIleAsnAsnTyrThrSer 

4. An envelope protein of an AIDS virus according 30 * 
to claim 1 with the amino acid sequence: 



PheGluProIleProIleHisTyrCysAlaProAlaGlyPheAlalleLeuLysCysAsnAsnLysThr 
PheAsnGlyThrGlyProCysThrAsnValSerThrValGlnCysThrHisGlylleArgProValVal 
SerThrGlnLeuLeuLeuAsnGlySerLeuAlaGluGluGluValVallleArgSerValAsriPheThr 
AspAsnAlaLysThrllelleValGlnLeuAsnThfSerValGlulleAsnCysThrArgProAsnAsn 
AsnThrArgLysLysIleArglleGlnArgGlyProGlyArgAlaPheValThrlleGlyLysIleGly 
AsnMETArgGlnAlaHisCysAsnlleSerArgAlaLysTrpAsnAlaThrLeuLysGlnlleAlaSer 
LysLeuArgGluGlnPheGlyAsnAsnLysThrllellePheLysGlnSerSerGlyGlyAspProGlu 
IleValThrHisSerPheAsnCysGlyGlyGluPhePheTyrCysAsnSerThrGlnLeuPheAsnSer 
ThrTrpPheAsnSerThrTrpSerThrGluGlySerAsnAsnThrGluGlySerAspThrlleThrLeu 
ProCysArglleLysGlnPhelleAsnMETTrpGlnGluValGlyLysAlaMETTyrAlaProProIle 
SerGlyGlnlleArgCysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGlyAsnAsnAsn 
AsnGlySerGluIlePheArgProGlyGlyGlyAspMETArgAspAsnTrpArgSerGluLeuTyrLys 
TyrLysValValLyslleGluProLeuGlyValAlaPrpThrLysAlaLysArgArgValValGlnArg 
GluLysArgAlaValGlylleGlyAlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerThrMETGly 
AlaAlaSerMETThrLeuThrValGlnAlaArgGlnLeuLeuSerGlylleValGlnGlnGlnAsnAsn 
LeuLeuArgAlalleGluAlaGlnGlnHisLeuLeuGlnLeuThrValTrpGlylleLysGLnLeuGln 
AlaArglleLeuAlaValGluArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCysSerGly 
LysLeuIleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrp 
AsnHisThrThrTrpMETGluTrpAspArgGluIleAsnAsnTyrThrSer 
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5. An envelope protein of an AIDS virus according 
to claim 1 with the amino acid sequence: 



METArgGlnAlaHisCysAsnlleSerArgAlaLysTrpAsnAlaThrLeuLysGlnlleAlaSer 

LysLeuArgGluGlnPheGlyAsnAsnLysThrllellePheLysGlriSerSerGlyGlyAspPrcjGlu 
IleValThrHisSerPheAsnCysGlyGlyGlu^ 

ThrTrpPheAsnSerThrTrpSerThrGlu^^ 

ProCysArglleLysGlnPhelleAsnMETO^ 

SerGlyGlnlleArgCysSerSerAsnlleTh^ 

AsnGlySerGluIlePheArgProGlyGlyGlyAspMETArgAspAsnTrpArgSerGluLeuTyrLys 

TyrLysValValLyslleGLuProLeuGlyValAlaProThrLysAlaLysArgArgValValGlnArg 

GLuLysArgAlaValGlylleGlyAlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerThrMETGly 
... AlaAlaSerMETThrLeuThrValGlnAlaAr^ 

LeuLeuArgAlaIleX3luAlaGlnGlruHisLeuLeuGlnLeuThrValTrpGlylleLysGlnLeuGLn 
AlaArglleLeuAlaValGluArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCysSerGly 
: LysLeuIleCysThrThrAlaValProTrpAsnALaSerTrpSerAsnLysSerLeuGluGlnlleTrp 
AsnHisThrThrTrpMETGluTrpAspArgGluILeAsnAsnTyrThrSer 



6. An envelope protein of an AIDS virus according 
to claim 1 with the amino acid sequence: 



AsnGlySerGluIlePheArgProGlyGlyGlyAspHETArgAspAsnTrpArgSerGluLeuTyrLys 

TyrLysValValLyslleGluProLeuGLyValALaProThrLysAlaLysArgArgValValGlnArg 

GluLysArgAlaValGlylleGlyAlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerThrMETGly 
AlaAlaSerMETThrLeuThrValGl^ 

LeuLeuArgAlalleGluJVlaGlnGlnHisLeuLeuGlnLeuThrValTrpGlylleLysGlnLeuGln 
AlaArglleLeuAlaVaLGluAro^yrLeuLysAspGlnGinLeuLeuGlylleTrpGlyCysSerGly 
LysLeuILeCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrp 
AsnHisThrThrTrpMETGluTrpAspArgGluIleAsnAsntyrThrSer 



AlaAlaSerMETThrLeuThrValGlnAlaArgGlnLeuLeuSerGlylleValGlnGlnGlr^ 
LeuLeuArgAlalleGluAlaGlnGlnHisLeuLeuGlnLeuThrValTrpGLylleLysGlnLeuGLn 
AlaArglleLeuAlaVaLGluArgTyrLeuLysAspGlnGlnLeuLeu^lylleTrpGLyCysSe 
LysLeuIleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrp 
AsnHtsThrThrTrpMETGluTrpAspArgGluIleAsnAsnTyrThrSer 
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7. An envelope protein of an AIDS virus according 
to claim 1 with the amino acid sequence: 



40 




8. An envelope protein as claimed in any one of 
claims 1 to 7 that is a homogeneous protein free of 
other AIDS viral proteins. 



55 



9. An expression vector comprising a gene coding 
for an envelope protein of an AIDS virus down- 
stream of a promoter sequence enabling transcrip- 
tion, translation and thus expression of said en- 
velope protein in a host cell. 
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10. An expression vector according to claim 9 
wherein said gene coding for an envelope protein 
of an AIDS virus is a gene comprising the 
nucleotide sequence: 

5 

GTGTGGAAGGAAGCA 

ACCACCACTCTATTTTGTGCATCAGATGCTAAAGCATATGATACAGAGGTACATAATGTTTGGGCCACA 
CATGCCTGTGTACCCACAGACCCCAACCCACAAGAAGTAGTATTGGTAAATGTGACAGAAAATTTTAAC 
ATGTGGAAAAATGACATGGTAGAACAGATGCATGAGGATATAATCAGTTTATGGGATCAAAGCCTAAAG 
CCATGTGTAAAATTAACCCCACTCTGTGTTAGTTTAAAGTGCACTGATTTGAAGAATGATACTAATACC 
AATAGTAGTAGCGGGAGAATGATAATGGAGAAAGGAGAGATAAAAAACTGCTCTTTCAATATCAGCACA 
AGCATAAGAGGTAAGGTGCAGAAAGAATATGCATTTTTTTATAAACTTGATATAATACCAATAGATAAT 
GATACTACCAGCTATACGTTGACAAGTTGTAACACCTCAGTCATTACACAGGCCTGTCCAAAGGTATCC 
TTTGAGCCAATTCCCATACATTATTGTGCCCCGGCTGGTTTTGCGATTCTAAAATGTAATAATAAGACG 
TTCAATGGAACAGGACCATGTACAAATGTCAGCACAGTACAATGTACACATGGAATTAGGCCAGTAGTA 
TCAACTCAACTGCTGTTAAATGGCAGTCTAGCAGAAGAAGAGGTAGTAATTAGATCTGTCAATTTCACG 
GACAATGCTAAAACCATAATAGTACAGCTGAACACATCTGTAGAAATTAATTGTACAAGACCCAACAAC 
AATACAAGAAAAAAAATCCGTATCCAGAGGGGACCAGGGAGAGCAXTTGTTACAATAGGAAAAATAGGA 
AATATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATGCCACTTTAAAACAGATAGCTAGC 
AAATTAAGAGAACAATTTGGAAATAATAAAACAATAATCTTTAAGCAATCCTCAGGAGGGGACCCAGAA 
ATTGTAACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTACTGTAATTCAACACAACTGTTTAATAGT 
ACTTGGTTTAATAGTACTTGGAGTACTGAAGGGTCAAATAACACTGAAGGAAGTGACACAATCACACTC 
CCATGCAGAATAAAACAATTTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATC 
AGCGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAAT7VACAAC 
AATGGGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATATAAA 
TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 
GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCAGCAGGAAGCACTATGGGC 
GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAGAACAAT 
TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACAGTCTGGGGCATCAAGCAGCTCCAG 
GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGATTTGGGGTTGCTCTGGA 
AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCTAGTTGGAGTAATAAATCTCTGGAACAGATTTGG 
AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGC 

35 

or an equivalent thereof. 

11. An expression vector according to claim 9 
wherein said gene coding for an envelope protein 
of an AIDS virus is a gene comprising the ^ 
nucleotide sequence: 



45 
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TGTCCAAAGGTATCC 

TTTGAGCCAATTCCCATACATTATTGTGCC^ 
TTCAATGGAACAGGACCATGTACAAATGTCAGCACAG^^^ 
TCAACTCAACTGCTGTTAAATGGCAGTCTAGCAGM^ 
GACAATGCTAAAACCATAATAGTACAGCTGAACACATCTGT^ 

AATACAAGAAAAAAAATCCGTATCCAGAGGGGACCAGGGAGAGCATTTGTTACAATAGGAAAAATAGGA 
AATATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATGCCACTTTAAAACAGATAGCTAC^ 
AAATTAAGAGAACAATTTGGAAATAATAAAACAATAATCOT^ 

ATTGTAACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTACTGTAATTCAAC^ 

ACTTGGTTTAATAGTACTTGGAGTACTGAAGGGTCAAATAACACTGAAGGAAGTGACACAATC 
CCATGCAGAATAAAACAATTTATAAACATGTGGCAGGAA 

AGCGGACAAATTAGATCTTCATCAMTATTACAGGGCTGCTATTAACAAGAGA 

AATGGGTCCGAGATCTTCAGACCTWAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATATAAA 

TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGC^ 

GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTT^ 

GCAGCGTCAATGACGCTGACGGTACAGGCCAG 

TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACACT 

GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGATTTGGGGTTGCTCTGGA 
AJUVCTAATTTGCACCACTGCTGTGCCTTGGAATC 

AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGC 
or an equivalent thereof. 

12. An expression vector according to claim 9 
wherein said gene coding for an envelope protein 25 
of an AIDS virus is a gene comprising the 
nucleotide sequence: 

ATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATGCCACTTTAAAACAGATAGCTA 
AAATTAAGAGAACAATTTGGAAATAATAAAACAATAATC^ 
ATTGTAAOTCACAGTTTTAATTGTGGAGGGGAATTTTTCT 
ACTTGGTTTAATAGTACTTGGAGTACTGAAGGGTCAAATAACACTGAAGGAAGTC 
CCATGCAGAATAAAACAATTTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTAT^ 
AGCGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTTC 
AATGGGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTOT 
TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCT^ 
GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAGC^ 
GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTC 
TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACA 
GCAAGAATCCTGGCTGTGGAAAGATACCTAAX^ 
AAACTAATTTGCACCACTGCTGTGCCTIKM^ 

AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGC 

or an equivalent thereof. 45 

13. An expression vector according to claim 9 
wherein said gene coding for an envelope protein 
of an AIDS virus is a gene comprising the 
nucleotide sequence: 50 
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ATGTATGCCCCTCCCATC 

AGCGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAACAAC 
AATCXKJTCCGAGATCTTCAGACCTGGAGGAGGAGATATGACKMACAATTGGAGAAGTGAATTATAT^ 
TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 

gaaaaaagagcagtgggaataggagctttgttccttgggttcttgggagcagcaggaagcactatc 

gcagcgtcaatgacgctgacggtacaggccagacaattattgtctggtatagtgcagcagcagaacaat 
ttgctgagggctattgaggcgcaacacx:atctgttgc^ 

GCAAGAATCCTGGCTGTGGAMGATACCTAAAGGATCAACAGCTCCTCKXJGATTTGGGGTTGCTCTGGA 
AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCTAGTTGGAGTAATAAATCTCTGGAACAGATTTGG 
AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGC 

or an equivalent thereof. 

14. An expression vector according to claim 9 
wherein said gene coding for an envelope protein '5 
of an AIDS virus is a gene comprising the 
nucleotide sequence: 

atgagggacaattggagaagtgaattatataaa 
tataaagtagtaaaaattgaaccattaggagtagcacccaccaackk:aaagagaagagtggtgcagaga 

GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCAGCAGGAAGCACTATGGGC 
GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAGAACAAT 
TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACAGTCTGGGGCATCAAGCAGCTCCAG 
GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGATTTGGGGTTGCTCTGGA 
AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCTAGTTGGAGTAATAAATCTCTGGAACAGATTTGG 
AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGCT 



30 23. A transformant according to claim 22 which is 
an E. coli MC 1061 strain. 

24. A transformant according to claim 21 which is a 
B. subtilis strain. 

35 

25. A transformant according to claim 21 which is a 
eucaryotic cell. 

26. A method of producing an envelope protein of 
40 an acquired immune deficiency syndrome virus as 

claimed in any one of claims 1 to 8 comprising: 

transforming a host cell with an expression vector 
as claimed in any one of claims 9 to 20; 

45 

culturing said host cell so that said AIDS env 
protein is expressed; and, 

extracting and isolating said AIDS env protein. 

so 

27. A method according to claim 26 wherein the 
expression vector is pEV1 , -2 or -3/env 44-640. 

28. A method according to claim 26 wherein the 
55 expression vector is pEV1 , -2 or -3/env 205-640. 

29. A method of testing human blood for the pres- 



15. An expression vector according to any one of 
claims 9 to 14 which is a plasmid capable of 
replication in gram-negative and/or gram-positive 
bacteria. 

16. An expression vector according to claim 15 
which is capable of replication in an E. coli strain. 

17. An expression vector according to claim 15 
which is capable of replication in a B. subtilis 
strain. 

18. An expression vector according to claim 15 or 
16 which is a member of the pEV/env family. 

19. An expression vector according to claim 18 
which is pEV1 , -2, or -3/env 44-640. 

20. An expression vector according to claim 18 
which is pEV1 , -2, or -3/env 205-640. 

21 . A transformant carrying an expression vector as 
claimed in any one of claims 9 to 20. 

22. A transformant according to claim 21 which is 
an E. coli strain. 



21 
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ence of antibodies to the viral etiologic agent of 
AIDS which comprises mixing a composition con- 
taining an envelope protein of an AIDS virus as 
claimed in any one of claims 1 to 8 with a sample 
of human blood and determining whether said en- 
velope AIDS protein binds to AIDS antibodies 
present in the blood sample. 

30. A method according to claim 29 which com- 
prises the use of the Western Blotting Analysis. 

31. A method according to claim 29 which com- 
prises the use of an ELISA-technique, wherein an 
envelope protein of an AIDS virus as claimed in 
any one of claims 1 to 8 is coated on a solid phase 
and contacted with the sample and after washing 
contacted with an enzyme-labeled non-human IgG. 

32. A method according to claim 29, wherein the 
Double-Antigen-Method is used. 

33. A method for the determination of AIDS virus, 
wherein antibodies against an envelope protein of 
an AIDS virus according to any one of claims 1 to 
8 are used. 

34. A method according to claim 33, wherein the 
antigen in the sample and a protein as claimed in 
any one of claims 1 to 8 in labeled form compete 
with an antibody against a protein as claimed in 
any one of claims 1 to 8. 

35. A method according to claim 33, wherein a 
sandwich method is performed using two anti- 
bodies against a protein as claimed in any one of 
claims 1 to 8. 

36. A method according to claim 35, wherein one 
antibody is on a solid phase and the other antibody 
is labeled. 

37. A method according to claim 35, wherein two 
different monoclonal antibodies are used. 



38. A vaccine eliciting immunity to AIDS compris- 
ing as an active ingredient a protein as claimed in 
any one of claims 1 to 8. 

s 39. Antibodies raised against a protein as claimed 
in any one of claims 1 to 8. 

40. The antibodies of claim 39 which are mon- 
oclonal antibodies. 

70 

41 . The use of a protein as claimed in anyone of 
claims 1-8 for the preparation of a protective im- 
munisation vaccine. 

75 42. The use of a protein as claimed in any one of 
claims 1 to 8 for testing human blood for the 
presence of AIDS virus. 

Claims for Austria: 

20 

1. A process for the preparation of an envelope 
protein of an acquired immune deficiency syn- 
drome (AIDS) virus comprising: 

25 transforming a host cell with an expression vector 
comprising a gene coding for an envelope protein 
of an AIDS virus downstream of a promoter se- 
quence enabling transcription, translation and ex- 
pression of said envelope protein in said host cell; 

30 

culturing said host cell so that said envelope pro- 
- tein of an AIDS virus is expressed; and 

extracting and isolating said envelope protein of an 
35 AIDS virus. 

2. A process according to claim 1, characterized in 
that as a gene coding for an envelope protein of an 
AIDS virus a gene capable of effecting expression 

40 of the protein comprising the amino acid sequence 
of: 



45 



50 
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ValLysGluLysTyrGlnHisLeuTrpArgTrpGlyTrpArgTrpOlyThrMETLeuLeuGlyMBTLeu 

METlleCysSerAUThrGluLysLeuTrpValThrValTyrTyrGlyValProValTrpLysGluAla 

ThrThrThrLeuPheCysAlaSerAspAlaLysAlaTyrAspThrGluValHisAsnValTrpAlaThr 

HisAlaCysValProThrAspProAsnProGlnGluValValLeuValAsnValThrGluAsnPheAsn 

METTrpLysAsnAspMETValGluGlnMETHisGluAspIlelleSerLeuTrpAspGlnSerLeuLys 

ProCysValLysLeuThrProLeuCysValSerLeuLysCysThrAspLeuLysAsnAspThrAsnThr 

AsnSerSerSerGlyArgMETIleMETGluLysGlyGluIleLysAsnCysSerPheAsnlleSerThr 

SerlleArgGlyLysValGlnLysGluTyrAlaPhePheTyrLysLeuAspllelleProIleAspAsn 

AspThrThrSerTyrThrLeuThrSerCysAsnThrSerVallleThrGlnAlaCysProLysValSer 

PheGluProIleProIleHisTyrCysAlaProAlaGlyPheAlalleLeuLysCysAsnAsnLysThr 

PheAsnGlyThrGlyProCysThrAsnValSerThrValGlnCysThrHisGlylleArgProValVal 

SerThrGlnLeuLeuLeuAsnGlySerLeuAlaGluGluGluValVallleArgSerValAsnPheThr 

AspAsnAlaLysThrllelleValGlnLeuAsnThrSerValGluIleAsnCysThrArgProAsnAsn 

AsnThrArgLysLysIleArglleGlnArgGlyProGlyArgAlaPheValThrlleGlyLysIleGLy 

AsnMETArgGlnAlaHisCysAsnlleSerArgAlaLysTrpAsnAlaThrLeuLysGlnlleAlaSer 

LysLeuArgGluGlnPheGlyAsnAsnLysThrllellePheLysGinSerSerGlyGlyAspProGlu 

IleValThrHisSerPheAsnCysGlyGlyGluPhePheTyrCysAsnSerThrGlnLeuPheAsnSer 

ThrTrpPheAsnSerThrTrpSerThrGluGlySerAsnAsnThrGluGlySerAspThrlleThrLeu 

ProCysArglleLysGlnPhelleAsnMETTrpGlnGluValGlyLysAlaMETTyrAlaProProIle 

SerGlyGlnlleArgCysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGlyAsnAsriAsn 

AsnGlySerGluIlePheArgProGlyGlyGlyAspMETArgAspAsnTrpArgSerGiuLeuTyrLys 

TyrLysValValLyslleGluProLeuGlyValAlaProThrLysAlaLysArgArgValValGlnArg 

GluLysArgAlaValGiylleGlyAlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerThrMETGly 

AlaAlaSerMETThrLeuThrValGlnAlaArgGlnLeuLeuSerGlylleValGlnGlnGlnAsnAsn 

LeuLeuArgAlalleGluAlaGlnGlnHisLeuLeuGlnLeuThrValTrpGlylleLysGlnLeuGIn 

AlaArglleLeuAlaValGluArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCysSerGly 

LysLeuIleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrp 

AsnHisThrThrTrpMETGluTrpAspArgGluIleAsnAsnTyrThrSerLeuIleHisSerLeuIle 

GluGluSerGlnAsnGlnGlnGluLysAsnGluGlnGluLeuLeuGluLeuAspLysTrpAlaSerLeu 

TrpAsnTrpPheAsnlleThrAsnTrpLeuTrpTyrlleLysLeuPhelleMETIleValGlyGlyLeu 

ValGlyLeuArglleValPheAlaValLeuSerValValAsnArgValArgGlnGlyTyrSerProLeu 

SerPheGlnThrHisLeuProIleProArgGlyProAspArgProGluGlylleGluGluGluGlyGly 

GluArgAspArgAspArgSerlleArgLeuValAsnGlySerLeuAlaLeuIleTrpAspAspLeuArg 

SerLeuCysLeuPheSerTyrHisArgLeuArgAspLeuLeuLeuIleValThrArglleValGluLeu 

LeuGlyArgArgGlyTrpGluAlaLeuLysTyrTrpTrpAsnLeuLeuGlnTyrTrpSerGlnGluLeu 

LysAsnSerAlaValSerLeuLeuAsnAlaThrAlalleAlaValAlaGluGlyThrAspArgVallle 

GluValValGlnGluAlaTyrArgAlalleArgHisIleProArgArglleArgGlnGlyLeuGluArg 
IleLeuLeu 



is used. . - 

3. A process according to claim 1 , characterized in 
that as a gene coding for an envelope protein of an 



AIDS virus a gene capable of effecting expression 
of the protein comprising the amino acid sequence 
of: 



so 
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ValTrpLysGluAla 

ThrThrThrLeuPheCysAlaSerAspAlaLysAlaTyrAspThrGluValHisAsnValTrpAlaThr 
HisAlaCysValProThrAspProAsnProGlnGluValValLeuValAsnValThrGluAsnPheAsn 
METTrpLysAsnAspMEWalGluGlnHETHisGluAspIlelleSerLeuTrpAspGlnSerLeuLys 
ProCysValLysLeuThrProLeuCysValSerLeuLysCysThrAspLeuLysAsnAspThrAsnThr 
AsnSerSerSerGlyArgMETIleMETGluLysGlyGLuIleLysAsnCysSerPheAsnlleSerThr 
SerlleArgGlyLysValGlnLysGluTyrAlaPhePheTyrLysLeuAspIlelleProIleAspAsn 
AspThrThrSerTyrThrLeuThrSerCysAsnThrSerVallleThrGlnAlaCysProLysValSer 
PheGluProIleProIleHisTyrCysAlaProAlaGlyPheAlalleLeuLysCysAsnAsnLysThr 
PheAsnGlyThrGlyProCysThrAsnValSerThrValGlnCysThrHisGlylleArgProValVaL 
SerThrGlnLeuLeuLeuAsnGlySerLeuAlaGLuGluGluValVallleArgSerValAsnPheThr 
AspAsnAlaLysThrllelleValGlnLeuAsnThrSerValGLuIleAsnCysThrArgProAsnAsn 
AsnThrArgLysLysIleArglLeGLnArgGlyProGlyArgAlaPheValThrlleGlyLysIleGly 
AsnMETArgGlnAlaHisCysAsnlleSerArgAlaLysTrpAsnAlaThrl-euLysGlnlleAlaSer 
LysLeuArgGluGLnPheGlyAsnAsnLysThrllellePheLysGlnSerSerGLyGLyAspProGlu 
IleValThrHisSerPheAsnCysGlyGlyGluPhePheTyrCysAsnSerThrGlnLeuPheAsnSer 
ThrTrpPheAsnSerThrTrpSerThrGluGlySerAsnAsnThrGluGlySerAspThrlleThrLeu 
ProCysArglleLysGlnPhelleAsnMETTrpGlrKSluValGlyLysAlaMETTyrAlaProProIle 
SerGlyGlnlleArgCysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGlyAsnAsnAsn 
AsnGlySerGluIlePheArgProGlyGlyGlyAspMETArgAspAsnTrpArgSerGluLeuTyrLys 
TyrLysValValLysIleGluProLeuGlyValAlaProThrLysAlaLysArgArgValValGlnArg 
GIuLysArgAlaValGlylleGlyAlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerThrMETGly 
AlaAlaSerMETThrLeuThrValGlnAlaArgGLnLeuLeuSerGlylleValGLnGlnGlnAsnAsn 
LeuLeuArgAlalleGluAlaGlnGlnHisLeuLeuGlnLeuThrValTrpGlylleLysGlnLeuGln 
AlaArglleLeuAlaVaLGluArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCysSerGly 
LysLeuIleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrp 
AsriHisThrThrTrpMETGluTrpAspArgGluIleAsnAsnTyrThrSer 

3ed * A 'DS virus a gene capable of effecting expression 

30 of the protein comprising the amino add sequence 

process according to claim 1, characterized in of: 
as a gene coding for an envelope protein of an 

. , CysProLysValSer 

PheGlu^roILeProILeHisTyrCysAlaProAlaGlyPheAlalleLeuLysCysAsnAsnuysThr 

PheAsnGlyThrGlyProCysThrAsnValSerThrValGInCysThrHisGlylleArgProValVal 

SerThrGlnLeuLeuLeuAsnGlySerLeuAlaGluGluGluValVallleArgSerValAsnPheThr 

AspAsnAlaLysThrllelleValGlnLeuAsnThrSerValGluIleAsnCysThrArgProAsriAsri 

AsnThrArgLysLysIleArglleGlnArgGlyProGlyArgAlaPheValThrlleGlyLysIleGly 

AsnMETArgGlnAlaHisCysAsnlleSerArgAlaLysTrpAsnAlaThrLeuLysGlnlleAlaSer 

LysLeuArgGluGlnPheGlyAsr^snLysThrllellePheLysGlnSerSerGLyGlyAspProGlu 
IleValThrHisSerPheAsnc^sGlyGl^ 

ThrTrpPheAsnSerThrTrpSerThrGluGlySerAsnAsnThrGluGlySerAspThrlleThrLeu 
PrcX^sArgIleLysGlru?heIleAsnM^^^ 

SerGlyGlnlleArgCysSerSerAsnlleThrGlyLe^^ 

AsnGLySerGluIlePheArgProGlyGlyGlyAspMETArgAspAsnTrpArgSerGluLeuTyrLys 

TyrLysValValLyslleGluProLeuGlyValAlaProThrLysAlaLysArgArgValValGlnArg 

GluLysArgAlaValGlylLeGlyAlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerThrMETGly 
AlaAlaSerMETThrLeuThrValGlnAl^ 

LeuLeuArgAlalleGluAlaGlnGlnHisLeuLeuGlnLeuThrValTrpGlylleLysGLnLeuGln 
AlaArglleLeuAlaValGluArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCysSerGly 
LysLeuIleCysThrThrALaValProTrpAsnALaSerTrpSerAsnLysSerLeuGIuGlnlleTrp 
AsnHisThrThrTrpMETGluTrpAspArgGluIleAsnAsnTyrThrSer 

55 
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is used. 

5. A process according to claim 1 , characterized in 
that as a gene coding for an envelope protein of an 



AIDS virus a gene capable of effecting expression 
of the protein comprising the amino acid sequence 
of: 



METArgGlnAlaHisCysAsnlleSerArgAlaLysTrpAsnAlaThrLeuLysGlnlleAlaSer 
LysLeuArgGluGlnPheGlyAsnAsnLysThrllellePheLysGlnSerSerGlyGlyAspProGlu 
IleValThrHisSerPheAsnCysGlyGlyGluPhePheTyrCysAsnSerThrGlnLeuPheAsnSer 
ThrTrpPheAsnSerThrTrpSerThrGluGlySerAsnAsnThrGluGlySerAspThrlleThrLeu 
ProCysArglleLysGlnPhelleAsnMETTrpGlnGluValGlyLysAlaMETTyrAlaProProIle 
SerGlyGlnlleArgCysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGlyAsnAsnAsn 
AsnGlySerGluIlePheArgProGlyGlyGlyAspMETArgAspAsnTrpArgSerGluLeuTyrLys 
TyrLysValValLysIleGluProLeuGlyValAlaProlhrLysAlaLysArgArgValValGlnArg 
GluLysArgAlaVaLGlylleGlyAlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerThrMETGly 
AlaAlaSerMETThrLeuThrValGlnAlaArgGlnLeuLeuSerGlylleValGlnGlnGlnAsnAsn 
LeuLeuArgAlalleGluAlaGlnGlnHisLeuLeuGlnLeuThrValTrpGlylletysGlnLeuGln 
AlaArglleLeuAlaValGluArgTyrLeuLysAspGlnGinLeuLeuGlylleTrpGlyCysSerGly 
LysLeulleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrp 
AsnHisThrThrTrpMETGluTrpAspArgGluIleAsnAsnTyrThrSer 



is used. 

6. A process according to claim 1 , characterized in 
that as a gene coding for an envelope protein of an 



AIDS virus a gene capable of effecting expression 
of the protein comprising the amino acid sequence 
of: 



METTyrAlaProProIle 

SerGlyGlnlleArgCysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGlyAsnAsnAsn 
AsnGlySerGluIlePheArgProGlyGlyGlyAspMETArgAspAsnTrpArgSerGluLeuTyrLys 
TyrLysValValLysIleG.luProLeuGlyValAlaProThrLysAlaLysArgArgValValGlnArg 
GluLysArgAlaValGlylleGlyAlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerThrMETGly 
AlaAlaSerMETThrLeuThrValGlnAlaArgGlnLeuLeuSerGlylleValGlnGlnGlnAsnAsn 
LeuLeuArgAlalleGluAlaGlnGlnHisLeuLeuGlnLeuThrValTrpGlylleLysGlnLeuGLn 
AlaArglleLeuAlaVaLGluArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCysSerGly 
LysLeulleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrp 
AsnHisThrThrTrpMETGluTrpAspArgGluIleAsnAsnTyrThrSer 

IS used - AIDS virus a gene capable of effecting expression 

of the protein comprising the amino acid sequence 
7. A process according to claim 1 , characterized in of: 
that as a gene coding for an envelope protein of an 

45 

METArgAspAsnTrpArgSerGluLeuTyrLys 
TyrLysValValLysIleGLuProLeuGlyValAlaProThrLysAlaLysArgArgValValGlnArg 
GluLysArgAlaValGlylleGlyAlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerThrMETGLy 
AlaAlaSerMETThrLeuThrValGlnAlaArgGlnLeuLeuSerGlylleValGlnGlnGlnAsnAsn 
LeuLeuArgAlalleGluAlaGlnGlnHisLeuLeuGlnLeuThrValTrpGlylleLysGlnLeuGlr. 
AlaArglleLeuAlaValGluArgTyrLeuLysAspGlnGlnLeuLeuGlyXleTrpGlyCysSerGly 
LysLeulleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTr? 
AsnHlsThrThrTrpMETGluTrpAspArgGluIleAsnAsnTyrThrSer 
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is used. 

8. A process according to any one of claims 1 to 7 
wherein the host cell is a bacterium. 

9. A process according to claim 8 wherein the 
bacterium is E coli. 

10. A process according to claim 9 wherein the 
expression vector is a plasmid of the pEV/env 
family. 

11. A process according to claim 10 wherein the 
plasmid is pEV1 , -2, or -3/env 44-640. 

12. A process according to claim 10 wherein the 

GTGTGGAAGGAAGCA 

ACCACCACTCTATTTTGTGCATCAGATCCT 
CATGCCTGTGTACCCACAGACCCCAA 
ATGTGGAAAAATGACATCXJTAGAACAGATGCATGA 
CCATGTGTAAAATTAACCCCACTCTGTGTTAGTT^ 

AATAGTAGTAGCGGGAGAATGATAATGGAGAAAGGAGAGATAAAAAACTGC^ 

AGCATAAGAGGTAAGGTGCAGAAAGAATATGCATTTTTrrATAAACTTC 

GATACTACCAGCTATACGTTGACAAGTTGTAACAC 

TTTGAGCCAATTCCCATACATTATTGTGCCC 

TTCAATGGAACAGGACCATGTACAAATGTCAGCACAGTAC^ 

TCAACTCAACTGCTGTTAAATGGCAGTCTAGCA^ 

GACAATGCTAAAACCATAATAGTACAGCTGAACACAT 

AATACAAGAAAAAAAATCCGTATCCAGAGGGGACCAGGGAGAG^ 

AATATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATGCCACra 

AAATTAAGAGAACAATTTGGAAATAATAAAACAATAATCTTTAAGCAATCCT 

ATTGTAACGCACAGTTTTAATTGTGGAGGGGAATTTTTC^ 

ACTTCK5TTTAATAGTACTTGGAGTACTGAAGGGTCAAATAACACTGAAGG 

CCATGCAGAATAAAACAATTTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTC 
- AGCGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCT 

AATGGGTCOJAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTTC 

TATAAAGTAGTAAAAATTGAACC&TTAGGAGTAK^ 
'GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCT 

GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAGAA 
TTGCTGAGGGCTATTGAGGCGCAACAGCATCTC^ 
GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTC 
AAACTAATTTGCACCACTGCTGTGCCTTGG^^ 

AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGC 

or an equivalent thereof is used. 

15. A process according to claim 13 characterized 
in that as a gene coding for an envelope protein of 
an AIDS virus a gene comprising the nucleotide 
sequence 

55 



plasmid is pEVI, -2, or -3/env 205-640. 

1 3, A process for the preparation of an expression 
vector comprising a gene coding for an envelope 

5 protein of an AIDS virus, which process comprises 
constructing an expression vector having an inser- 
tion site wherein said gene may be inserted which 
insertion site is downstream of a promoter se- 
quence enabling transcription, translation and thus 
io expression of said envelope protein in a host cell. 

14. A process according to claim 13 characterized 
in that as a gene coding for an envelope protein of 
an AIDS virus a gene comprising the nucleotide 

is sequence 
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TGTCCAAAGGTATCC 

TTTGA(K:CAATTCCCATACATTATTGTGCCCCGGCTCXSTTTTC«33ATTCTAAAATOTAATAATAAGACO 
TTCAATGGAACAGGACCATGTACAAATGTCAGCACAGTACAATGTACACATOGAATTAGGCCAGTAGTA 
TCAACTCAACTGCTGTTAAATGGCAGTCTAGCAGAAGAAGAGGTAGTAATTAGATCTCTCAATTTCACG 
GACAATGCTAAAACCATAATAGTACAGCTGAACACATCTGTAGAAATTAATTGTACAAGACCCAACAAC 
AATACAAGAAAAAAAATCCGTATCCAGAGGGGACCAGGGAGAGCATTTGTTACAATAGGAAAAATAGGA 
AATATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATGCCACTTTAAAACAGATAGCTAGC 
AAATTAAGAGAACAATTTGGAAATAATAAAACAATAATCTTTAAGCAATCCTCAGGAGGGGACCCAGAA 
ATTGTAACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTACTGTAATXCAACACAACTGTTTAATAGT 
ACTTGGTTTAATAGTACTTGGAGTACTGAAGGGTCAAATAACACTGAAGGAAGTGACACAATCACACTC 
CCATGCAGAATAAAACAATTTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATC 
AGCGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAACAAC 
AATGGGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATATAAA 
TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 
GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCAGCAGGAAGCACTATGGGC 
GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAGAACAAT 
TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACAGTCTGGGGCATCAAGCAGCTCCAG 
GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGATTTGGGGTTGCTCTGGA 
AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCTAGTTGGAGTAATAAATCTCTGGAACAGATTTGO 
AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGC 



or an equivalent thereof is used. 

16. A process according to claim 13 characterized 
in that as a gene coding for an envelope protein of 
an AIDS virus a gene comprising the nucleotide 
sequence 



ATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATGCCACTTTAAAACAGATAGCTAGC 
AAATTAAGAGAACAATTTGGAAATAATAAAACAAiAATCTTTAAGCAATCCTCAGGAGGGGACCCAGAA 
ATTGTAACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTACTGTAATTCAACACAACTGTTTAATAGT 
ACTTGGTTTAATAGTACTTGGAGTACTGAAGGGTCAAATAACACTGAAGGAAGTGACACAATCACACTC 
CCATGCAGAATAAAACAATTTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATC 
AGCGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAACAAC 
AATGGGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATATAAA 
TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 
GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCAGCAGGAAGCACTATGGGC 
GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATACTGCAGCAGCAGAACAAT 
TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACAGTCTGGGGCATCAAGCAGCTCCAG 
GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGATTTGGGGTTGCTCTGGA 
AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCTAGTTGGAGTAATAAATCTCTGGAACAGATTTGG 
AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGC 



or an equivalent thereof is used. 



17. A process according to claim 13 characterized 
in that as a gene coding for an envelope protein of 
an AIDS virus a gene comprising the nucleotide 
sequence 
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ATGTATGCCCCTCCCATC 

AGCGGACAAATTAGATGTTCATCAAATATTAC 

AATGGGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATATAAA 
TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCtt^ 

GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCAGCAGGAA 
GCAGCGTCAATGACGCTGACGGTACAGGCC^^ 

TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACAGTCTGGGGCATCAAG 

GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGO 

AAACTAATTTGCACCACTGCTGTGCCT 

AATCACACGACGTGGATGGAGTGGGACAGAGAAA77TAACAATTACACAAGC 
or an equivalent thereof is used. 



18. A process according to claim 13 characterized 
in that as a gene coding for an envelope protein of 
an AIDS virus a gene comprising the nucleotide 
sequence 



15 



ATGAGGGACAATTGGAGAAGTGAATTATATAAA 
TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 
GAAAAAAGAGCAGTGGGAATAGGAGCETTGTTCCT 

GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTC 
TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAA 
GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGTO 
AAACTAATTTGCACC^CTGCTCTGCCTTGGAATGCT 

AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGCT 



or an equivalent thereof is used. 

1 9. A process according to any one of claims 1 3 to 
18 wherein the expression vector is a plasmid 
capable of replication in gram-negative bacteria. 

20. A process according to claim 19 wherein the 
plasmid is capable of replication in an E. coli strain. 

21 . A process for the preparation of a transformant 
carrying an expression vector comprising a gene 
coding for an envelope protein of an AIDS virus, 
which process comprises transforming a microor- 
ganism with an expression vector obtained accord- 
ing to any one of claims 13 to 20 and cultivating 
the transformed microorganism. 

22. A process according to claim 21 wherein the 
microorganism is an E. coli strain. 

23. A process according to claim 22 wherein the 
microorganism is an E. coli MC 1061 strain. 

24. A process of testing human blood for the pres- 
ence of antibodies to the viral etioiogic agent of 
AIDS which process comprises mixing a composi- 
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tion containing an eveiope protein of an AIDS virus 
obtained according to any one of claims 1 to 12 
with a sample of human blood and determining 
whether said envelope AIDS protein binds to AJDS 
antibodies present in the blood sample. 

25. A process according to claim 24 which com- 
prises the use of the Western Blotting Analysis. 

26. A process according to claim 24 which com- 
prises the use of an Elisa-technique. wherein an 
envelope protein of an AIDS virus obtained accord- 
ing to any one of claims 1 to 12 is coated on a 
solid phase and contacted with the sample and 
after washing contacted with an enzyme-labeled 
non-human IgG. 

27. A process according to claim 24, wherein the 
Oouble-Antigen-Method is used. 

28. A process for the determination of AIDS virus, 
wherein antibodies against an envelope protein of 
an AIDS virus obtained according to any one of 
claims 1 to 12 are used. 

29. A process according to claim 28. wherein the 
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antigen in the sample and a protein obtained ac- 
cording to*any one of claims 1 to 12 in labeled 
form compete with an antibody against a protein 
obtained according to any one of claims 1 to 12. 

30. A process according to claim 28, wherein a 
sandwich method is performed using two anti- 
bodies against a protein obtained according to any 
one of claims 1 to 12. 

31. A method according to claim 30, wherein one 
antibody is on a solid phase and the other antibody 
is labeled. 

32. A method according to claim 30, wherein two 
different monoclonal antibodies are used. 

33. An envelope protein of an AIDS virus whenever 
prepared by a process as claimed in any one of 
claims 1 to 12. 

34. An envelope protein of an AIDS virus according 
to claim 33 that is a homogeneous protein free of 



other AIDS viral proteins. 

35. An expression vector comprising a gene coding 
for an envelope protein of an AIDS virus whenever 

5 prepared by a process as claimed in any one of 
claims 13 to 20. 

36. A transformant carrying an expression vector 
comprising a gene coding for an envelope protein 

w of an AIDS virus whenever prepared by a process 
as claimed in any one of claims 21 to 23. 

37. An expression vector comprising a gene coding 
for an envelope protein of an AIDS virus down- 

15 stream of a promoter sequence enabling transcrip- 
tion, translation and thus expression of said en- 
velope protein in a host cell. 

38. An expression vector according to claim 37 
20 wherein said gene coding for an envelope protein 

of an AIDS virus is a gene comprising the 
nucleotide sequence: 



GTGTGGAAGGAAGCA 

ACCACCACTCTATTTTGTGCATCAGATGCTAAAGCATATGAXACAGAGGTACATAATGTTTGGGCCACA 
CATGCCTGTGTACCCACAGACCCCAACCCACAAGAAGTAGTATTGGTAAATGTGACAGAAAATTTTAAC 
ATGTGGAAAAATGACATGGTAGAACAGATGCATGAGGATATAATCAGTTTATGGGATCAAAGrCCTAAAG 
CCATGTGTAAAATTAACCCCACTCTGTGTTAGTTTAAAGTGCACTGATTTGAAGAATGATACTAATACC 
AATAGTAGTAGCGGGAGAATGATAATGGAGAAAGGAGAGATAAAAAACTGCTCTTTCAATATCAGCACA 
AGCATAAGAGGTAAGGTGCAGAAAGAATATGCATTTTTTTATAAACTTGATATAATACCAATAGATAAT 
GATACTACCAGCTATACGTTGACAAGTTGTAACACCTCAGTCATTACACAGGCCTGTCCAAAGGTATCC 
TTTGAGCCAATTCCCATACATTATTGTGCCCCGGCTGGTTTTGCGATTCTAAAATGTAATAATAAGACG 
TTCAATGGAACAGGACCATGTACAAATGTCAGCACAGTACAATGTACACATGGAATTAGGCCAGTAGTA 
TCAACTCAACTGCTGTTAAATGGCAGTCTAGCAGAAGAAGAGGTAGTAATTAGATCTGTCAATTTCACG 
GACAATGCTAAAACCATAATAGTACAGCTGAACACATCTGTAGAAATTAATTGTACAAGACCCAACAAC 
AATACAAGAAAAAAAATCCGTATCCAGAGGGGACCAGGGAGAGCATTTGTTACAATAGGAAAAATAGGA 
AATATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATGCCACTTTAAAACAGATAGCTAGC 
AAATTAAGAGAACAATTTGGAAATAATAAAACAATAATCTTTAAGCAATCCTCAGGAGGGGACCCAGAA 
ATTGTAACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTACTGTAATTCAACACAACTGTTTAATAGT 
ACTTGGTTTAATAGTACTTGGAGTACTGAAC^ 

CCATGCAGAATAAAACAATTTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATC 
AGCGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAACAAC 
AATGGGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATATAAA 
TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 
GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCAGCAGGAAGCACTATGGGC 
GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAGAACAAT 
TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACAGTCTGGGGCATCAAGCAGCTCCAG 
GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGATTTGGGGTTGCTCTGGA 
AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCTAGTTGGAGTAATAAATCTCTGGAACAGATTTGG 
AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGC 
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or an equivalent thereof. 

39. An expression vector according to claim 37 
wherein said gene coding for an envelope protein 

of an AIDS virus is a gene comprising the s 
nucleotide sequence: 

TGTCCAAAGGTATCC 

TTTGAGCCAATTCCCATACATTATTGTGCCCCGGC^ 
TTCAATGGAACAGGACCATGTACAAATGTCAGC^^ 

TCAACTCAACTGCTGTTAAATGGCAGTCTAGCAGAAGAAGAGGTAGTAATTAGAT 
GACAATGCTAAAACCATAATAGTACAGCTGAAC^^ 

AATACAAGAAAAAAAATCCGTATCCAGAGGGGACCAGGGAGAGCATTTC 

AATATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATGCCACTTTAAAA 

AAATTAAGAGAACAATTTGGAAATAATAAAACAATAATCTTTAAGCAATCCTCAOT 

ATTGTAACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTACTGTAACT 

ACTTGGTTTAATAGTACTTGGAGTACTGAAGGGTCAAATAACACT 

CCATGCAGAATAAAACAATTTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTA 

AGCGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATTC 

AATGGGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTAT 

TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTTC 

GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCT 

GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAG 
TTGCTGAGGGCTATTGAGGCGCAACAGC^ 

GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGATTTG 

AAACTAATTTGCACCACTGCTGTGCCTTOTAATGCTAGT 

AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGC 

or an equivalent thereof. - 30 

40. An expression vector according to claim 37 
wherein said gene coding for an envelope protein 
of an AIDS virus is a gene comprising the 
nucleotide sequence: 35 

ATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATGCCACTTTAAAACAGATAGCTA^ 
AAATTAAGAGAACAATTTCGAAATAATAAAACAATAATCTTT 
ATTGTAACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTO 
ACTTGGTTTAATAGTACTTGGAGTACTGAAGGGTCAAATAACACTC 
CCATGCAGAATAAAACAATTTATAAACATGTGGCAGGAAGTAGGAAAAGCAATC 
AGCGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTATTA 

AATGGGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATATAAA 

TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 

GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCAGCA 

GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTC^ 

TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACAGT^ 

GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGAT^ 

AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCTAGTTGGAGTAA 

AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGC 
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or an equivalent thereof. 

41. An expression vector according to claim 37 
wherein said gene coding for an envelope protein 
of an AIDS virus is a gene comprising the 5 
nucleotide sequence: 



. ...r.# ATGTATGCCCCTCCCATC 
: - " AGCGGACAAATTAGATGTTCATCAAATATTACAGCGCTGCTATTAACAAGAGATGGTGGTAATAACAAC 

AATGGGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATATAAA 
TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 
GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCAGCAGGAAGCACTATCK^ 
GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAGAACAAT 
TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACAGTCTGGGGCATCAAGCAGCTCCAG 
^j;;Ovj GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGATTTGGGGTTGCTCTGGA 
'P^^f^ AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCTAGTTGGAGTAATAAATCTCTGGAACAGATTTGG 
^ilS^] AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGC 

or an equivalent thereof. 

42. An expression vector according to claim 37 
wherein said gene coding for an envelope protein 
of an AIDS virus is a gene comprising the 25 
nucleotide sequence: 

' '■ ■ I 

ATGAGGGACAATTGGAGAAGTGAATTATATAAA 
TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 
GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGXTCTTGGGAGCAGCAGGAAGCACTATGGGC 
GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAGAACAAT 
TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACAGTCTGGGGCATCAAGCAGCTCCAG 
GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGATTTGGGGTTGCTCTGGA 
AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCTAGTTGGAGTAATAAATCTCTGGAACAGATTTGG 
AATCACACGACGTGGATGGAGTGGGACAGAGAAA1?TAACAATTACACAAGCT 



43. An expression vector according to any one of 
claims 37 to 42 which is a plasmid capable of 
replication in gram-negative bacteria. 

44. An expression vector according to claim 43 
which is capable of replication in an E. coli strain. 

45. An expression vector according to claim 43 or 
44 which is a member of the pEV/env family. 

46. An expression vector according to claim 45 
which is pEV1 , -2, or -3/env 44-640. 

47. An expression vector according to claim 45 
which is pEV1 , -2, or -3/env 205-640. 

48. A transformant carrying an expression vector as 
claimed in any one of claims 37-47. 



49. A transformant according to claim 48 which is 
an E. coli strain. 

40 

50. A transformant according to claim 49 which is 
an E. coli MC 1061 strain. 

45 51. Antibodies raised against a protein obtained 
according to any one of claims 1 to 12 and 33 and 
34. 

52. The antibodies of claim 51 which are mon- 
50 oclonal antibodies. 

53. A vaccine eliciting immunity to AIDS compris- 
ing as an active ingredient a protein obtained ac- 
cording to any one of claims 1 to 12 and 33 and 

55 34 ' 
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54. The use of a protein as claimed in anyone of 
claims 1-8 for the preparation of a protective im- 
munisation vaccine. 
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FIGURE 1 



1 ATTCTGCAACAACTGCTGTTTATCCATTTTCAGAATTGGGTGTCGACATAGCAGAATAGGCGTTACTCG 69 
70 ACAGAGGAGAGCAAGAAATGGAGCCAGTAGATCCTAGACTAGAGCCCTGGAAGCATCCAGGAAGTCAGC 138 
139 CTAAAACTGCTTGTACCAATTGCTATTGTAAAAAGTGTTGCTTTCATTGCCAAGTTTGTTTCATAACAA 207 
208 AAGCCTTAGGCATCTCCTATGGCAGGAAGAAGOGGAGACAGCGACGAAGACCTCCTCAAGGCAGTCAGA 276 
277 CTCATCAAGTTTCTCTATCAAAGCAGTAAGTAATACATGTAATGCAACCTATACAAATAGCAATAGTAG 345 
346 CATTAGTAGTAGCAATAATAATAGCAATAGTTGTGTGGTCCATAGTAATCATAGAATATAGGAAAATAT 414 
415 TAAGACAAAGAAAAATAGACAGGTTAATTGATAGACTAATAGAAAGAGCAGAAGACAGTGGCAATGAGA 483 
484 GTGAAGGAGAAATATCAGCACTTGTGGAGATGGGGGTGGAGATGGGGCACCATGCTCCTTGGGATGTTG 552 
553 ATGATCTGTAGTGCTACAGAAAAATTGTGGGTCACAGTCTATTATGGGGTACCTGTGTGGAAGGAAGCA 621 
622 ACCACCACTCTATTTTGTGCATCAGATGCTAAAGCATATGATACAGAGGTACATAATGTTTGGGCCACA 690 
691 CATGCCTGTGTACCCACAGACCCCAACCCACAAGAAGTAGTATTGGTAAATGTGACAGAAAATTTTAAC 759 
760 ATGTGGAAAAATGACATGGTAGAACAGATGCATGAGGATATAATCAGTTTATGGGATCAAAGCCTAAAG 828 
829 CCATGTGTAAAATTAACCCCACTCTGTGTTAGTTTAAAGTGCACTGATTTGAAGAATGATACTAATACC 897 
898 AATAGTAGTAGCGGGAGAATGATAATGGAGAAAGGAGAGATAAAAAACTGCTCTTTCAATATCAGCACA 966 
967 AGCATAAGAGGTAAGGTGCAGAAAGAATATGCATTTTTTTATAAACTTGATATAATACCAATAGATAAT 1035 
1036 GATACTACCAGCTATACGTTGACAAGTTGTAACACCTCAGTCATTACACAGGCCTGTCCAAAGGTATCC 1 104 
1105 TTTGAGCCAATTCCCATACATTATTGTGCCCCGGCTGGTTTTGCGATTCTAAAATGTAATAATAAGACG 1173 
1174 TTCAATGGAACAGGACCATGTACAAATGTCAGCACAGTACAATGTACACATGGAATTAGGCCAGTAGTA 1242 
1243 TCAACTCAACTGCTGTTAAATGGCAGTCTAGCAGAAGAAGAGGTAGTAATTAGATCTGTCAATTTCACG 1311 
1312 GACAATGCTAAAACCATAATAGTACAGCTGAACACATCTGTAGAAATTAATTGTACAAGACCCAACAAC 1380 
1381 AATACAAGAAAAAAAATCCGTATCCAGAGGGGACCAGGGAGAGCATTTGTTACAATAGGAAAAATAGGA 1449 
1450 AATATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATGCCACTTTAAAACAGATAGCTAGC 1518 
1519 AAATTAAGAGAACAATTTGGAAATAATAAAACAATAATCTTTAAGCAATCCTCAGGAGGGGACCCAGAA 1587 
1588 ATTGTAACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTACTGTAATTCAACACAACTGTTTAATAGT 1656 
1657 ACTTGGTTTAATAGTACTTGGAGTACTGAAGGGTCAAATAACACTGAAGGAAGTGACACAATCACACTC 1725 
1726 CCATGCAGAATAAAACAATTTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATC 1794 
1795 AGCGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAACAAC 1863 
1864 AATGGGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATATAAA 1932 
1933 TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 2001 
2002 GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCAGCAGGAAGCACTATGGGC 2070 
2071 GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAGAACAAT 2139 
2140 TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACAGTCTGGGGCATCAAGCAGCTCCAG 2208 
2209 GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGATTTGGGGTTGCTCTGGA 2277 
2278 AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCTAGTTGGAGTAATAAATCTCTGGAACAGATTTGG 2346 
2347 AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGCTTAATACACTCCTTAATT 2415 
2416 GAAGAATCGCAAAACCAGCAAGAAAAGAATGAACAAGAATTATTGGAATTAGATAAATGGGCAAGTTTG 2484 
2485 TGGAATTGGTTTAACATAACAAATTGGCTGTGGTATATAAAATTATTCATAATGATAGTAGGAGGCTTG 2553 
2554 GTAGGTTTAAGAATAGTTTTTGCTGTACTTTCTGTAGTGAATAGAGTTAGGCAGGGATATTCACCATTA 2622 
2623 TCGTTTCAGACCCACCTCCCAATCCCGAGGGGACCCGACAGGCCCGAAGGAATAGAAGAAGAAGGTGGA 2691 
2692 GAGAGAGACAGAGACAGATCCATTCGATTAGTGAACGGATCCTTAGCACTTATCTGGGACGATCTGCGG 2760 
2761 AGCCTGTGCCTCTTCAGCTACCACCGCTTGAGAGACTTACTCTTGATTGTAACGAGGATTGTGGAACTT 2829 
2830 CTGGGACGCAGGGGGTGGGAAGCCCTCAAATATTGGTGGAATCTCCTACAATATTGGAGTCAGGAGCTA 2898 
2899 AAGAATAGTGCTGTTAGCTTGCTCAATGCCACAGCTATAGCAGTAGCTGAGGGGACAGATAGGGTTATA 2967 
2968 GAAGTAGTACAAGAAGCTTATAGAGCTATTCGCCACATACCTAGAAGAATAAGACAGGGCTTGGAAAGG 3036 
3037 ATTTTGCTATAAGATGGGTGGCAAGTGGTCAAAAAGTAGTGTGGTTGGATGGCCTGCTGTAAGGOAAAG 3105 
3106 AATGAGACGAGCTGAGCCAGCAGCAGATGGGGTGGGAGCAGCATCTCGAGA 3156 
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FIGURE 2 (3 pages) 

1 50 

HXB-3 MRVKEK YQHLWRWGWRWGTMLLGMLMICSATEKLWVTVYYGVPVWKEATT 

BH-10 

BH-8 F 
LAV K I 

ARV-2 K --GTRRN - 

51 100 

HXB-3 TLFCASDAKAYDTEVHNVWATHACVPTDPNPQEWLVNVTENFNMWKNDM 

BH-10 

BH-8 

LAV 

ARV-2 R G N 

101 150 

HXB-3 VEQMHEDI I S LWDQS L KPCVK LTP LCVS L KCTDLKNDTNTNS S SGRMIME 

BH-10 
BH-8 

LAV G A ■ NTNSS E M 
ARV-2 Q T N G A NWKEEI 

151 200 

HXB-3 KGEIKNCSFNISTSIRGKVQKEYAFFYKLDI IPIDND--TTSYTLTS CNTSV 

BH-10 

BH-8 K 
LAV 

ARV-2 T D I N L RN W AST N NYRLIH R 

201 250 

HXB - 3 I TQACP KVSFE P I P I HYCAP AGF A I L KCNNKTFNGTGPCTNVS TVQCTHG 

BH-10 

BH-8 

LAV A 
ARV-2 T K 

251 300 

HXB-3 IRPWSTQLLLNGSLAEEEWIRSVNFTDNAKTI I VQLNT S VE I NCTR PN 
BH-10 A Q 

BH-8 D 
LAV A Q 

ARV-2 I D N E A 
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301 350 

HXB - 3 NNTRKK I R I QRGPGRAF VT I GK I GNMRQ- AHCN I S RAKWNATLKQ IASKLR 
BH-10 S N D 

BH-8 D 
LAV S 

ARV-2 S Y -- H T R IGDIRK Q N E VK 



351 400 

HXB -3 EQFGNNKTI IFKQSSGGDPE IVTHSFNCGGEFFYCNSTQLFNSTWFNSTW 

BH-10 

BH-8 

LAV 

ARV-2 V N MR T N -RLNH 



401 450 

HXB-3 STEGSNNTEGSDTITLPCRIKQF INMWQEVGKAMYAPPISGQIRCSSNIT 

BH-10 K I 

BH-8 K I 
LAV 

ARV-2 - K N I I G S 



451 500 

HXB - 3 GLLLTRDGG-NNNNGSE I FRPGGGDMRDNWRSEL YKYKWK I EPLGVAPTK 
BH-10 - S E 

BH-8 - S E 

LAV 

ARV-2 T VT DT V II 



501 - 550 

HXB-3 AKRRWQREKRAVGI-GALFLGFLGAAGSTMGAASMTLTVQARQLLSGIVQ 

BH-10 

BH-8 

LAV - R 

ARV-2 V M V L 

551 600 

HXB - 3 QQNNLLRA I EAQQHLLQLTVWG I KQLQAR I LAVER YLKDQQLLG I WGCSG 

BH-10 G 

BH-8 

LAV 

ARV-2 V R 



0 199 301 



601 650 

HXB-3 KL ICTTAVPWNASWSNKSLEQIWNHTTWMEWDRE INNYTSL IHSL IEESQ 
BH-10 NM 
BH-8 NM 
LAV NM 

ARV-2 D DNM Q E D NT YT 

651 700 

HXB-3 NQQEKNEQELLELDKWASLWNWFNITNWLWYIKLFIMIVGGLVGLRIVFA 

BH-10 

BH-8 

LAV I 
ARV-2 S I 



701 750 

HXB-3 VLSWNRVRQGYSPLSFQTHLP IPRGPDRPEGIEEEGGERDRDRS IRLVN 
BH-10 

BH-8 I N 

LAV I T 

ARV-2 I R V D V D 



751 800 

HXB-3 GSLALIWDDLRSLCLFSYHRLRDLLL I VTRIVELLGRRGWEALKYWWNLL 
BH-10 

BH-8 

LAV 

ARV-2 FE R AATIH S 



801 850 

HXB - 3 QYWS QE LKNSAVS L LNATA I AVAEGTDR V I EWQEAYRA IRHIPRRI RQG 

BH-10 G 

BH-8 N LA 

LAV G C 

ARV-2 I W T A R L H 



851 856 

HXB-3 LERILL 

BH-10 

BH-8 

LAV 

ARV-2 L 

" - " designates a deletion of one amino acid. An empty 
denotes identity with HXB-3 sequence. 
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Figure 5 
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FIGURE 6 A 



METArg 

ValLysGluLysTyrGlnHisLeuTrpArgTrpGlyTrpArgTrpGlyThrfCTLeuLeuGlyHETLeu 

HETrieCysSerAUThrGluLysLeuTrpValThrValTyrTyrGlyValProValTrpLysGluAla 

ThrThrThrLeuPheCysAI aSerAspAl aLysAl aTyrAspThrGl uVa 1 Hi sAsnVal TrpAl aThr 

HisAlaCysValProThrAspProAsnProGlnGluValValLeuValAsnValThrGluAsnPheAsn 

HEnrpLysAsnAsprtETValGluGlnMETHisGluAspIlelleSerLeuTrpAspGlnSerLeuLys 

ProCysValLysLeuThrProLeuCysValSerLeuLysCysThrAspLeuLysAsnAspThrAsnThr 

AsnSerSerSerGlyArgMETIleHETGluLysGlyGluIleLysAsnCysSerPheAsnlleSerThr 

Serl 1 eArgGI yLysVal Gl nLysG 1 uTyrAI aPhePheTyrlysLeuAspI 1 el 1 eProI 1 eAspAsn 

AspThrThrSerTyrThrLeuThrSerCysAsnThrSerVallleThrGlnAlaCysProLysValSer 

PheGl uProI 1 eProI 1 eH i sTyrCysAI aProA 1 aGl yPheAl all eLeuLy sCysAsnAsnLysThr 

PheAsnGlyThrGlyProCysThrAsnValSerThrValGlnCysThrHisGlylleArgProValVal 

SerThrGl nLeuLeuLeuAsnGl ySerLeuAl aGl uGl uGl uVa 1 Val II eArgSerVal AsnPheThr 

AspAsnAl aLysThrl 1 el 1 eVal G1 nleuAsnThrSerVal Gl ul 1 eAsnCysThrArgProAsnAsn 

AsnThrArgLystysIleArglleGlnArgGlyProGlyArgAlaPheValThrlleGlyLysIleGly 

AsnMETArgGl nA1 aH \ sCysAsnl 1 eSerArgAI aLysTrpAsnAl aThrLeuLysGl nlleAl aSer 

LysLeuArgGl uGl nPheGl yAsnAsnLysThrl 1 el 1 ePheLysGl nSerSerGI yGl yAspProGI u 

I 1 eVal ThrHi sSerPheAsnCysGl yGI yGI uPhePheTyrCysAsnSerThrGI nteuPheAsnSer 

ThrTrpPheAsnSerThrTrpSerThrGluGlySerAsnAsnThrGluGlySerAspThrlleThrLeu 

ProCysArglleLysGlnPhelleAsnMETTrpGlnGluValGlyLysAlaMETTyrAlaProProIle 

SerGlyGlnlleArgCysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGlyAsnAsnAsn 

As nG 1 ySerG 1 u 1 1 ePheArgProG lyGlyGl yAspME TArgAspAsnT rpArgSerG 1 uLeuTyr Lys 

TyrLysValValLysIleGluProLeuGlyValAlaProThrLysAlaLysArgArgValValGlnArg 

GluLysArgA1aVa1GlyIleG1yAlaLeuPheLeuG1yPheLeuG1yAlaA1aGlySerThrMETGly 

A1 aAl aSerMETThrLeuThrVal G1 nAl aArgGl nLeuLeuSerG 1 yll eVal Gl nGl nGI nAsnAsn 

LeuLeuArgAl al 1 eGl uAl aGl nGl nH i sLeuleuGl nleuThrVal TrpGl yll eLysGl nLeuGl n 

AlaArglleLeuAlaValGluArgTyrLeuLysAspGlnGlnteuLeuGlylleTrpGlyCysSerGly 

LysLeuIleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrp 

AsnHisThrThrTrpMETGluTrpAspArgGluIleAsnAsnTyrThrSerLeuIleHisSerLeuIle 

Gl uGl uSerGl n As nGl nGl nGl uLysAsnGl uGl nGl u leu LeuG 1 uLeuAsplys TrpAl aSerleu 

TrpAsnTrpPheAsnlleThrAsnTrpLeuTrpTyrlleLysLeuPhelleHETIleValGlyGlyLeu 

ValGlyLeuArglleValPheAlaValleuSerValValAsnArgValArgGlnGlyTyrSerProLeu 

SerPheGlnThrHlsLeuProIleProArgGlyProAspArgProGluGlylleGluGluGluGlyGly 

G 1 uArgAspAr gAspArgSerl 1 eArg Leu Va lAsnGI ySerteuA laLeuXl eTrpAspAsp LeuArg 

SerLeuCysLeuPheSerTyrHisArgLeuArgAspLeuLeuLeuIleValThrArglleValGluLeu 

LeuGlyArgArgGlyTrpGluAlaLeuLysTyrTrpTrpAsnLeuLeuGlnTyrTrpSerGlnGluLeu 

LysAsnSerAlaValSerLeuLeuAsnAlaThrAlalleAlaValAlaGluGlyThrAspArgVallle 

GluValValGlnGluAlaTyrArgAlalleArgHisIleProArgArglleArgGlnGlyLeuGluArg 

IleLeuLeu 
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FIGURE 6B 



AMINO ACID DISTRIBUTION 
OF AIDS ENV PROTEIN 



Name 


Number of 


A Alanine 


47 


B Aspartic Acid-Asparagine 


0 


C Cysteine 


21 


D Aspartic Acid 


27 


E Glutamic Acid 


49 


F Phenylalanine 


26 


G Glycine 


58 


H Histidine 


14 


I Isoleucine 


63 


K Lysine 


44 


L Leucine 


83 


M Methionine 


17 


N Asparagine 


60 


P Proline 


29 


Q Glutamine 


42 


R Arginine 


52 


S Serine 


57 


T Threonine 


60 


V Valine 


56 


W Tryptophan 


31 


Y Tyrosine 


20 


Z Glutamine-Glutamic Acid 


0 
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